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THEOREMS ON NORMAL MATRICES 
By H. SCHNEIDER (Edinburgh) 


[Received 12 September 1951; in revised form 10 December 1951] 


1. As a corollary to general considerations on matrices A, B whose 
commutator A B— BA is nilpotent, Drazin, Dungey, and Gruenberg (2) 
have recently obtained this theorem on normal matrices:f ‘Let A and B 
be normal matrices whose latent roots are «;, B; (t = 1,...,n). Then A and 
B commute if and only if the latent roots of every scalar polynomial f(A, B) 
are f(«;,B;) (¢ = 1,..., m) for some ordering «;, B;.’ 

Using a different approach I shall show that for the commutativity 
of the normal matrices A and B it is sufficient to make apparently 
weaker assumptions. In these f(A, B) will be considerably specialized 
and a condition, implied by the theorem quoted, will be imposed on the 
sum of squared moduli of latent roots only. 

Every square matrix A has a ‘polar representation’ A = HU, where 
H is non-negative definite Hermitian, U is unitary [Wintner and 
Murnaghan (7), Williamson (6)]. The matrices H and U commute if 
and only if A is normal. If A, B, and AB are normal, it is known that 
the matrices of the polar representation of A and B commute in pairs, 


except possibly the unitary pair, provided that the latter are properly 
chosen.{ This suggested that, if polar representations were considered, 
a similar theorem might be obtained for normal, but not necessarily 
commutative, matrices, whose product is also normal. 

To avoid repetition in the statement of the theorems I shall assume 
throughout that A and B are normal xn matrices with complex 
elements and latent roots «;, B; (i = 1,..., n) respectively. Also that 


A=HU=UH and B=KV=VK 


are the polar representations of A and B, where H and K are non- 
negative definite Hermitian, U and V unitary. 


+ A review of several important properties of normal matrices may be found 
in Drazin (1). I shall generally use the results of that paper without giving 
further references. 

t Wiegmann (5). The unitary polar matrix of a singular matrix is not unique. 
The proviso is necessary when the rank of either A or B is less than or equal 
to n—2. 

Quart. J. Math. Oxford (2), 3 (1952), 241-9. 
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2. My results rest on a property of matrices due to Schur (4): 
Let C be a matrix with latent roots y, (i = 1,..., n); then 


n n 
bs lys|? < > 3 leis|?, 
i=l ij=1 


and Dn =,5 lel? 
if and only if C is normal. 

I shall first prove a lemma. 

Lemma. Let C be a normal nxn matrix, whose latent roots are y;, 
(7 = 1...., 2; § = 1,..., &), where s n, =n. Let C be partitioned so that 
the k matrices C;,; in the diagonal i aed If the sum of squared moduli 
of latent roots of C;; is > lvl’ for all i, then C is the direct sum 

Fe 
CO = 01, 4+-Cygt... +Cy- 


Proof. Let S; be the sum of squared moduli of elements of C;,; and let 
S= & leasl*. 
i.j=1 
Using Schur’s theorem for C;; we have 


n 
PA < S; (@ —_ a k). 


Since C is normal, 
k ni 


s= 3 Sal <> 5, 
But S > s S;; unless C;; = 0 (t #7), when S = s S;. It follows that 
C;; = 0 i # j) and therefore that int 

Ee eT ee 


It will be convenient to denote the sum of squared moduli of latent 
roots of a matrix A by A(A). 


THEOREM 1. The matrices A and B commute if and only if the sum of 
squared moduli of latent roots of every scalar polynomial f(A)B is 


S ioe 


for some ordering «;, B; (t = 1,..., n). 








a 
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Proof. By a well-known theorem of Frobenius there is an ordering 
for which the latent roots of every f(A)B are f(«;)B;, provided that A 
and B commute. This ordering obviously satisfies the theorem. 

Now suppose there is an ordering for which 


A(f(A)B} = ¥ [fla Bil, 
for every f(A)B. 
Since A is normal, there is a unitary matrix ® such that 
D'A®D = D = diag[ ay, ag,..., X], 
where equal a; are arranged consecutively. Thus 
D = a Lt+o,1,4...to,},, 


where a; ~ a; when i Fj. 

Suppose that there are n; latent roots of A equal to «; and denote these 
now by a;, (j = 1.,..., n;). Denote the corresponding roots of B in the 
enunciation of the theorem by f,,. Let ®’B® = C, and let C be parti- 
tioned conformably with D. F 

We may construct the polynomials f(x) (¢ = 1,..., &) for which 

f(x) = 1, f(aj;) = 9, when) ¥ 2%. 


Then f,(D)C = ®'f,(A)B®, and so, by hypothesis and by the definition 
of f(x), we have 


k 
A{fi(D)O} = Mf(A)BY = ¥ |fulos) 





(> l6nl*) = 2 Io!) 
7s j=1 
Also f«D) = 0,4+...40,,4+540,,.+...+0,, 


where the O, are null-matrices, and the J; are unit-matrices, of order n,. 
Hence, if G; = f,(D)C is partitioned conformably with D, then G;,; = 0, 
when k + i, while G;,;; = C,;. It follows that the latent roots of f,(D)C 
are those of C;; together with zeros. By (1), 


A(Cy) = AfAD)C} = ¥ 1B 


But the latent roots of C = ®’BO are Bi, (7 = Lyons 43 ¢ a J...., &). 
The lemma applies and we deduce that 


0 mtg kth hl 


This is sufficient for D and C to commute. Transforming by ®, we see 
that A and B commute. 
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Suppose that a; 8;, and «;,, 8;, are orderings of the latent roots of A 
and B satisfying Frobenius’ s theorem. The latent roots of f;(A)B are 
B,, or Bi, (j = 1,..., m4), together with n—n, zeros. Hence in Frobenius’s 
theorem the latent roots of B associated with a set of equal latent roots 
of A form a unique set. This is not true in the case of our theorem. For 
suppose that a,,, 8;, and «,,, Bj, satisfy 


ni 


km k 
A(fA)B} = ¥ F [flopBol? = SS [flow NBi 
for all polynomials f(A)B. Then we must have 
Sul? = 3 1B? 6 = 1... B, (2) 
j=1 j=1 


by putting f(A)B = f,(A)B (i = 1...., k). Conversely, if «;, B;, is an 
ordering satisfying Theorem 1, and (2) is satisfied for some ordering 
a;,, B;, of the latent roots of B, then 


AHAB} = & ¥ [Naw Bol? = 3 lsadle( ¥ las) 


ni 


k ni k 4 
PP PPA 


and therefore the ordering «;,, 8;, also satisfies the theorem. I state 
these results in a corollary. 


> 








CoroLtuaRy. Let the latent roots of A be Qi, (f == 1,..., 845 6 = I,,..., B), 
where a, = x; and «, # a; when i # j; and let the latent roots of B be B,, 
or Bi, (j = 1,..., m3 7 = 1,..., k). Let A and B commute. Then, if as» Bi, 
is an ordering satisfying Theorem 1, «;,, Bj, 18 another if and only if 


Ni ni f 
> 1B;,|* 7 p |B: |? (0 a 1,..., k). 
j=1 j=1 
It is easily seen from an example that there may be various groupings 
of the 8;, into k sets of n; members each, the sum of squared moduli of 
the ith set being |B:,|?. 
j=1 
3. I shall next prove a theorem concerning a unitary matrix B. It has 
no analogue for general normal B. 
THEOREM 2. Let B = V be unitary. Then AV is normal if and only if 
the sum of squared moduli of latent roots of AV is s | cx,|?. 
i=1 











ve _ 


Ww 
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Proof. It is well known that multiplication by a unitary matrix does 
not alter the sum of squared moduli of elements of a matrix. 
Let C = AV, and let the latent roots of C be y; (¢ = 1,..., n). Then 


; n ‘ n ‘ n . 
S= >% lel? = Dd lal? = > foul? 
ij=1 ij=1 i=1 


since A is normal. By Schur’s theorem C is normal if and only if 


n 
dln? = 8, 
i=] 
and the theorem follows from S = >) | x,|?. 
i=1 


When AV is normal the polar matrices of A and those of AV commute. 
The latent roots of the unitary matrices V and UV are of unit modulus. 
Denote the latent roots of H, U, UV, by h,, u;, w; (¢ = 1.,..., m) respec- 
tively. If C = AV, then C = H.UV isa polar representation of C, and 
H and UV commute. By Frobenius’s theorem, 


and hw; = ¥i3 
whence ly;| = |a,|. 


On using Theorem 2 we obtain the non-trivial part of Corollary 1. 


n 
CoRoLLary |. The sum of squared moduli of latent roots of AV is > |«;|* 
i=1 
if and only if the moduli of latent roots of AV are |x,| (¢ = 1,..., n). 


Next let H be a non-negative definite Hermitian matrix, with latent 

roots h; (i = 1...., n). It.is known that h; > 0. By Theorem 2, 

n n 

di = 2 lvl? 

i=1 i=1 
ifandonlyifC = HV isnormal. But CisnormalifandonlyifHV = VH. 
Corollary 2 [cf. Parker (3)] now follows. 

CoroLLaRy 2. Let H be a non-negative definite Hermitian, and V be 
unitary. Then H and V commute if and only if the sum of squared moduli 
of latent roots of H is equal to the sum of squared moduli of latent roots of HV . 

It will be seen that Theorem | is a generalization of this corollary. 


4. In Theorems 4 and 5 we shall be considering polynomials /(A) 
whose polar representation is f(A) = g(H)W. The significance of this 
restriction is brought out by Theorem 3. 
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THEOREM 3. The polar representationt of f(A) is f(A) = g(H)W, where 
g(H) is non-negative definite Hermitian and W is unitary, if and only if 


|f(as)| = |F(e;)| = g(loul), (3) 
when |a;| = |a,|. 
Proof. Let D = O'A® = diag[a,,..., a,]. 
Then D= 0'HO.9'U® 


is a polar representation of D. If 
R = diag[|a|,..., ||]; 


there is a diagonal matrix Z = diag[z,,...,2z,]|, |z;| = 1,forwhich D = RZ 
is also a polar representation of D. The Hermitian polar matrix is unique, 


and therefore R= Ho. 

Suppose that f(A) = 9(A)W. 

Then f(D) = O'f(A)® = O'9(H)®.0'WO = g(R).0'WO. 

Also f(D) = diag| f(a,),..., f(a) ], (4) 
and g(R) = diag[g(|o|);---» 9(|%n|)], (5) 


while, by an argument similar to that for D and R above, it follows from 
f(D) = 9(R). 8’ WO 

that g(R) = diag[|f(a,)|,..., |f(«n)| ]. (6) 

By comparing (5) and (6), we deduce (3). 


Conversely, let us assume (3). We obtain (4) and (5), by the definition 
of D and R. Then (6) follows by (3). Thus f(D) = g(R)X = Xqg(R) is 


a polar representation, where X = diag[2,,..., x,], |z;| = 1. Hence 
f(A) = Of(D)®’ = O9(R)XO' = g(H)W, 
for W = 0X9’, 


and the theorem is proved. 

I shall use this theorem due to Wiegmann (5): ‘Let A and B be normal. 
Then AB is normal if and only if H and B, A and V, commute.’ A version 
of this theorem has already been quoted in the introductory remarks. 


THEOREM 4. et A, Band AB be normal. Then, if the Hermitian polar 
matrices of f(A mp8 A are functions of the Hermitian polar matrices of A, B 
respectively, ry. ) is normal. 


+ Since the unitary matrix W is here left unrestricted we may say ‘the polar 
representation’ meaning ‘all polar representations’. 





ti-m ae the tin 
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Proof. Let f(A)=g(H)W, UB) = m(K)X. 


By Wiegmann’s theorem, H and B, A and K commute. Hence g(H) 
and /(B), f(A) and m(K) commute. Using Wiegmann’s theorem in the 
reverse direction, we see that f(A)l(B) is normal. 

If f(A)l( B) is normal, it is easily seen that the Hermitian polar matrices 
of f(A), /(B) may not be functions of H and K. 


THEOREM 5. If f(A)l(B) is normal, the moduli of its latent roots are 
|f(a,)U(B;)| (¢ = 1,..., n), for some ordering «;, B;. 


Proof. Let f(A) = GW, lB) = MX be polar representations; f(A) 
and /(B) are normal. As in Theorem 3, we may prove that the latent 
roots of G, M are | f(«,)|, |1(B;)| (¢ = 1,..., m) respectively. Now f(A)l(B) 
is normal and therefore Gand M, W and M, G and X commute, provided 
that W and X are suitably chosen, by the version of Wiegmann’s 
theorem quoted in the introduction. Hencet 


f(A\UB) = GWMX = GMWX = WXGM. 
By Frobenius’s theorem the latent roots of GM are | f(«;)l(B;)| for some 


ordering, and the latent roots of f(A)l(B) are |f(«;)l(B;)|y;, where the 
y; (\y;| = 1) are the latent roots of WX. The theorem follows. 


5. The matrix AB is normal when AB = BA. Hence, by Theorem 1, 


if n 
Mf(A)B} = & | flo Bil? 


for every f(A), then AB is certainly normal. In Theorem 6, I shall prove 
that it is sufficient to assume rather less for the normality of AB. 
THEOREM 6. The matrix AB is normal if and only if, for some ordering 
n 


a,;, B; (i = 1,..., 2), > | f(a,)B;|* equals the sum of squared moduli of latent 


=1 
roots of every scalar polynomial f(A)B for which the Hermitian polar matrix 
of f(A) is a polynomial in the Hermitian polar matrix of A. 
Proof. Suppose that AB is normal. Then, if f(A) = g(H)W and 
= KV are polar representations, we may use Theorem 4 with/(B) = B. 
It follows that f(A)B is normal. By Theorem 5 the moduli of latent roots 
of f(A)B are thus | f(«;)8;| (¢ = 1,..., m), for some ordering a,, B;. Hence 


Aff(A)B} = ¥ | flo) (7) 


+ When f(A)l(B) is normal, then f(A)l(B) = GMWX for all polar matrices 
W and X. This is easily proved. 
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Suppose now that there is an ordering «,, 8; for which (7) is satisfied 
for every f(A) that has a polar representation f(A) = g(H)W. We have 
already proved that, if 


0'A®D = D = diag[a,..., a], 


then ®’H® = R = diag[|q,|,..., |a,|]. 

On constructing the polynomial y(x) satisfying y(a;) = |«;| (i = 1,..., ), 
it follows that (D) = R, 

and (A) = OY(D)O' = ORD’ = H. 

Since H is a polynomial in A, any polynomial in H is also a polynomial 
in A. Now let f(A) = g HW. 


We may suppose that g(H) = ¢(A), and this implies g(R) = ¢(D). From 
this equation and from Theorem 3 it follows that 


$(a;) = g(|ox«| ) = | f(a) | (¢ = l,..., 2). 


Since, of course, ¢(A) = g(H) is its own Hermitian polar matrix, 
A{g(H)B} = A{¢(A)B} = 2, |PodBil? = 2 lolol Bil? (8) 


by hypothesis. We have proved (8) only for any non-negative Hermitian 
g(H). From an inspection of the proof of Theorem 1 it is immediately 
clear that H and B commute if (8) is assumed only for the polynomials 
g;(H) defined for H in the same way as f,(A) is defined for A. These 
polynomials f,(A) used in the proof of Theorem 1 are non-negative 
definite Hermitian, being normal matrices with non-negative latent roots. 
Hence the corresponding g,;(H) are also non-negative definite Hermitian; 
(8) holds for them and the commutativity of H and B follows. 

If 4(A) = H, we have the polar representations 4(A) = HJ and 
A = HU. Hence, by the hypothesis, 


A(HB) = ¥ |aBil? 


and A(AB) = > |x, B;|?. 
i=1 


We deduce that A(AB) = A(HB), where HB is normal, since H and B 
commute. On using Theorem 2 we see that AB is normal. The theorem 


is thus proved. 
A corollary may be obtained by an argument similar to that which 


led to the corollary of Theorem 1. 
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CoroLLaRy. Let the latent roots of A be «;,(j = 1,..., mj; t = l,..., 7), 
where |c;,| = |aj| and |a;| A |aj| whent A j; and let the latent roots of B 
be B;, or Bi, (j = 1,..., m3 t = 1,..., 7). Let AB be normal. Then, if x, B,, 
is an ordering satisfying Theorem 6, «;,, B;, is another if and only if 





mi 


mi 
> |B)? = 2 |B? @ = 1... 7). 
j=1 j=1 
6. We may remark in conclusion, without proof, that the polynomials 
f{A) and g,(H) are the principal idempotent elements of A and H 
respectively [cf. Drazin (1)]. 
In this connexion, as some work in the proof of Theorem 6 indicates, 


the condition 


A{fu(A)B} = ¥ [By |® (6 = Doane B) 


is a sufficient guarantee of the commutativity of A and B. Analogously, 
the normality of AB is ensured by the condition 


A{g,(H)U B} = 3 16. ie Batt 
j= 


for some polar representation A = HU, though this requires much fuller 


amplification. 
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THE REDUCTION BY ONE OF THE NUMBER OF 
INDEPENDENT VARIABLES IN SOME SYSTEMS 
OF PARTIAL DIFFERENTIAL EQUATIONS 


By A. J. A. MORGAN (Pasadena) 


[Received 23 October 1951] 
1. Introduction 
BIRKHOFF has suggested a method whereby the number of independent 
variables of some partial differential equations can be reduced by one. 
His proposed method is illustrated by several examples (1), (2). 

The work presented herein constitutes a generalization of Birkhoff’s 
suggestion, with some essential differences, to systems of arbitrary partial 
differential equations and provides a rigorous proof of its validity. 

Let ®; = 0 (6 = 1,..., n) be a system of arbitrary partial differential 
equations. Let G, be an arbitrary continuous one-parameter group of 
transformations, of a form to be specified later, of the dependent and 
independent variables of the equations ®; = 0. It will be shown that, 
if each of the equations in the system ®; = 0 is invariant? under G;, 
then the number of its independent variables can be reduced by one. 

Two examples, one from classical elasticity and one from fluid 
mechanics, are given to illustrate the theory. 


2. The one-parameter continuous group of transformations to 
be considered 
The succeeding investigation will be concerned with a one-parameter 
continuous group of transformations of the form: 


Q.: # = f(z... 2"; a) (¢ = l,,..., 8; > 2), 

2. (1) 
Is =Solysia) (6 = 1,...,n;n > I), 

where a is a numerical parameter and the transformations 2‘ > £! form 

a sub-group Sz, of G,. Subsequently the x! and ys will, respectively, be 

identified with the independent and dependent variables of a system of 

partial differential equations. 

Consider now the y; to be differentiable functions of the 2‘ up to any 
required order. If the transformations of the partial derivatives of the 
Ys with respect to the x‘ are appended to those of (1), it can be shown 
that the resulting set of transformations also forms a continuous one- 
parameter group. The new groups constructed in this manner will be 


+ The meaning of this terminology will be made precise in Definition 4. 


Quart. J. Math. Oxford (2),3 (1952), 250-9. 
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called enlargements of the group G, and will be denoted by G¥",..., GF 
according as the transformations among the first, second,..., kth partial 
derivatives of the ys are added successively to those of G,,..., G@F*-. 

By elementary group theory (3) the groupt (1) has m+n—1 fune- 
tionally independent (when considered as functions of the m-+-n inde- 


pendent variables z',..., x”, y;,..., Y,) absolute invariants 
«a ee, | 
ae Se ae a (2) 


and it is possible to choose them so that the Jacobian 








O(Grr-+» In) _, 0. (3) 
O(Yy5--2) Yn) 
and R O(Tyy-++> Nm-1) = m— s (4) 
As" ,..., 2") 


where # denotes the rank of the indicated Jacobian matrix. 

Suppose that we now consider the ys and g; to be implicitly defined as 
functions of the x‘ and # (respectively) by the equations} 

Zq(Z*,..05 S™) = Gal Gnse00s Pops Bipoosy 2) (5) 
and %o(Z*,..., 2”) = galG,,..., Puy ¥*,..-, 2”), (6) 
where the gs are the absolute invariants of the group (1) indicated in 
(2). The following result can be shown to hold. 

THEOREM 1. A necessary and sufficient condition for the ys, implicitly 
defined as functions of x",..., x™ by the relations (5), to be exactly the same 
functions of x',..., 2” as the Gs, implicitly defined as functions of £*,..., 2” 
by the relations (6), are of the #',..., ™ is that 

tale? ,...g 2) we EP ...., 2") = al F,..., B”). (7) 
The condition (7) can be replaced by 
2q(z",...5 2) =e Bilas...) Qua); (8) 
where the ns are the set of absolute invariants of the sub-group x' > &' of 
the growp (1). 

Proof. The necessity of the condition will first be proved. The gs are 
absolute invariants of the group (1); hence, under its transformations, 
they must satisfy the relations 

Jal Hares Pgs Vi y-v-y B™) = GolGF,---: Pas #,..-» 2”) (9) 
when the y,..., Y,, v',..., # are considered as independent variables. 

+ A continuous one-parameter group of transformations will sometimes be 


referred to as, simply ‘a group’. 


t Unless otherwise indicated, Greek and Latin indices will have the ranges 


Bssces n and 1...., m respectively. 
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Now consider the ys and gs to be functions of the x‘ and £‘ respectively. 
Since, by hypothesis, the ys are exactly the same functions of the x‘ 
that the 7; are of the £*, the gs, when considered as functions of the 2‘, 
must also have the same property. Hence on combining the relations 
(5), (6), (9) it follows that 

eee x”) os If#...., z") = 2s(z?,..., 2”) 
under the transformations of the group (1). 


To prove the sufficiency, proceed as follows. As a consequence of (3) the 
relations (5) and (6) can be inverted for the ys and #; to give, respectively, 


HalZ* 005 B™) =e Digl Ss ,.005 Bigg Bi yo0s 2”) (10) 
and 9s(2*,..., 2”) = hs(Z,;--+; &.. #,..<, 2") (11) 
in some neighbourhood of the (z",...,2”) and (z!,...,@") spaces respec- 


tively. 

The gs are, under the transformations of the group (1), exactly the 
same functions of the ys and 2‘ as they are of the 9; and z‘. Hence, 
the hs must be exactly the same functions of the zs and 2‘ as the hs are of 
the Z; and z‘. But, by hypothesis, the zs are exactly the same functions 
of the x‘ as the Z; are of the @‘. Therefore, by (10), (11), and the previous 
arguments, the ys; must be exactly the same functions of the 2‘ as the 7; 
are of the z*. 

The relations (7) imply that the zs are absolute invariants of the sub- 
group S,,. By elementary group theory every absolute invariant of Sg, 
must be a function of the m—1 functionally independent absolute 
invariants 7},..-, ),—1; hence, the condition (7) can be replaced by (8), and 
this completes the proof of the theorem. 

In view of the above result, on identifying the ys and x‘ with the 
dependent and independent variables of a system of partial differential 
equations respectively, it is convenient to define: 

DEFINITION 1. By invariant solutions of a system of partial differential 
equations is meant that class of solutions of a system of partial differential 
equations which have the property that the ys are exactly the same 
functions of the 2‘ as the g; are of the =’. 

When considering invariant solutions of systems of partial differential 
equations, by Theorem 1 it is only necessary to consider the relations 
(5) or (6) to be given in the form 


Fo 105 5<0+5 Neg) = GolHas---s Far Fis-00 2), (12) 


where the functions F3(7,,..., 7,,-,) are at least in class C®. 
By (3) and the assumed continuity and differentiability properties of 
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the functions F; and g; it can be shown that the conditions of the implicit 
function theorem are satisfied by (12). Hence, the ys can be expressed as 
functions of the F; and 2‘ in some neighbourhood of the (z",..., x”) space, 
say, Vg == Agl Hy 0105 Fog B y0009 2”). (13) 


3. Conformally invariant differential forms 
DEFINITION 2. By a differential form of the k-th order in m independent 
variables is meant a function, usually in class C™ or greater, of the form 
oky ok 
(2... L™, Ys 50009 Youreees wane wean) (14) 
whose arguments are the variables 2’,..., x”, functions y,,..., y, dependent 
on them, and the partial derivatives of the ys with respect to the x‘ up 
to the kth order. 

Suppose that each of the arguments in (14) transforms under the 
transformation laws of a continuous one-parameter group with symbol V 
[(3) 28]} and numerical parameter a. Each of the arguments in (14) can 
then be considered as an independent variable of the group with symbol V. 
If there are p such arguments, there will be no loss in generality in calling 
them 2',..., 2”. 





Derinition 3. A differential form ® will be said to be conformally 
invariant under a one-parameter group G,, if, under the transformations 
of the group, it satisfies the relation 

@(Z1,..., Z?) = F(z',..., 2?; a) O(z',..., 27), (15) 
where © is exactly the same function of the z’s as it is of the Z’s and F is 
some function of the z’s and the parameter a. 

It will be said that ® is constant conformally invariant if under the 
transformations of the group with symbol V the relation (15) is satisfied 
with the F a function of a only; ® will, under the same conditions, be 
said to be absolutely invariant if the relation (15) is satisfied with F 
identically equal to one. 

THEOREM 2. Jf ® of (14) is at least in class C® with respect to each of 
its arguments, then a necessary and sufficient condition for ® to be con- 
formally invariant under a continuous one-parameter group of transforma- 
tions with symbol V is that 

V® = w(z',..., 2?) O(z',..., 2”) (16) 
for some w(z’,..., 2”). 


| The symbol of a one-parameter group is a first-order linear partial 
differential operator, and in this case 


C é 


o 
V= E2(z1, .00y z?) Fei bee HEP (zh sees 2) oz? 
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The proof of the above statement is similar to that given by Kisenhart 
(3) for the case of absolute invariance and need not be repeated here. 
The condition (16) of Theorem 2 can be replaced by 


D(21,..., 22) = ebet-2)Da(21,..., 2”), (17) 


where ®, is a general absolute invariant of the group with symbol V and 
C(z1,..., 2”) is a determinable function of z!,...,z”. The result (17) can be 
obtained by solving the first-order partial differential equation (16) by 
standard methods (4) and supposing that the solution of the equation 
V® = 0 is known. The condition (17), rather than (16), will be most 
useful in the succeeding investigation. 


DEFINITION 4. It will be said that a system of k-th order partial differen- 
tial equations D; = 0 is invariant under a continuous one-parameter group 
of transformations G, if each of the kth order differential forms ®; is 
conformally invariant under the transformations of G¥*. 


4. Systems of partial differential equations invariant under 
continuous one-parameter groups of transformations 
Consider a system of partial differential equations of the kth order in 

m (> 2) independent variables z"',..., x” and n (> 1) dependent variables 

Yy5+++5 Yn Of the form: 





ak k 

0,(21,.. L™, Ys jeoey Ynsrees xane ath) ant, (18) 

where each of the differential forms is conformally invariant under the 
kth enlargement of the group G;.. 

The question to be answered is: Can the invariant solutions of the 
system (18) be expressed in terms of the solutions of a system of the kth 
order partial differential equations in m— 1 independent variables? The 
following result shows that the answer is in the affirmative. 


THEOREM 3. If each of the differential forms ®,,..., ®,, in the system 
of partial differential equations (18) is conformally invariant under the 
k-th enlargement of the group (1), then the invariant solutions of (18) can 
be expressed in terms of the solutions of a system of the form 


o*F, oF, 
As n- ee ES, sae qk” oa] 0, 
a system of k-th order partial differential equations in m—1 independent 
variables 74,..-; Nm—1- In the above the 7's are the m—1 absolute in- 
variants of the group Sg, and the functions Fy, at least in class C™, are 


those of relations (12). 


(19) 
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Proof. By (4) it is possible to express m—1 of the x’s in terms of 
M+» Nm and the remaining x, say *x™. Hence, it is possible to write 


xt = hi(qyy.0) tm *2™) (i = 1... m—1). (20) 
By (20) the relations (13) can be expressed as: 
9g = Dig F002. Fas Bae--+> Yen~ae 8). (21) 


The kth order partial derivatives of the ys with respect to the z’s can 
now be written as 
Or F oF; 
Yojrde = Ce Peers geseg my Fs, Ni» “x, (22) 
ON; ,++-ON:, ons, 
where 
wey B xz 1... Hy Egyooey Og BE 1.0. M— 1, Fayeeey Ju = 1,0) M 
and the notation indicates that the functions Q,;,;, may depend on 
, O* F oF; 
all of the variables ae: oie iy —*? Ki, nj, and *x™, 
On;,---On:, Oni, 
The substitution of (20), (21), and (22) into each of the differential 
forms ®; of (18) yields the relations 





oF, oF. 
®,. = Bi *z™, »,,..., Py ey ee ieee! tS 23 
) ( nN Nm-1> #1 n énk Ont _, ) 


By using the condition (17) of Theorem 2 each of the differential forms 
®; can always be expressed as 
O; = eb6 5p, (24) 
where the functions s®, are absolute invariants of the group G¥* and the 
arguments of the functions of f; and 5®, are exactly the same as those 
of Ds. 
Similarly, by (23) and on using (20), (21), (22) in (24), the following 


expression is obtained P 
I Bs = €% sho, (25) 


where the functions s4,, when considered in terms of the variables of GF*, 
are absolute invariants of the group G¥* and, in particular, 


a 


oky, oky, 
Axi” a(a™)* 





5,(2" sig DP, Bisons Buyin 
OXF, OF, 
E> 0609 a 
on Onm-1 


— sbo(*™ ——_ , | ee ) (26) 


under the substitutions (20), (21), (22). The arguments of the function 
os are exactly the same as those of Bs. 


+ Each of these quantities, when considered in terms of the variables of 


GMs... GE: respectively, can be shown to be absolute invariants of the groups 
YE VE 
G —— 7) 1, 
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It remains to be shown that each of the differential forms 34, is inde- 
pendent of *z”. The following argument is used. Let the ys be any set 
of functions of the x‘ having the invariance property defined in Theorem 1. 
eid sPo _ sfio(2,..., x”), (27) 
and, under the transformations of G*, 

™ ee ha ok Re 5 
Dal Z1,..., E™, Jayooes Joayoos = ” ,z™), (28 
where the same functional notation sys, has been used in (27) and (28) since 
the ys are exactly the same functions of the x‘ as the @; are of the z*. 
The s®, are absolute invariants of the group G¥*. Hence 





s¥io(2",..., 2™ = siho(t" +» E™). (29) 

The relations (29) imply that the i are ial invariants of the group 
Sg,; theref ‘ 

-~ _—— sfio(2",..., x) ~— St o(M1s-+s Nm-1): (30) 


that is, the sy, must, by elementary group-theory, always be expressible 
as functions of the m—1 absolute invariants 7,..., )m_-1 Of Sg,. 

Similarly, since the F; are (arbitrary) functions of the 7’s, it follows 
that 





oF 8 oF 
ate( to Nisee*> Vm—-w Fy 000s Figseoes EE alee —— — so(*2™, Trreers Nm-1): 
ony Nm-1 


(31) 


All of the arguments in the 54, of the above equations can assume 
arbitrary values; hence equations (31) can be treated as identities. 
The equations (26), (30), (31) imply that 

SE o(M1s-++) Nm—1) = sbo(*™s Nys--+> Nm—a)- (32) 
But, the equations (32) imply that the 54, must be independent of *2™ 
since otherwise the 37, could not be absolute invariants of the group Sg,. 
On retracing the steps in the above argument it is seen that the 54, of 
(25) must be independent of *x”. Hence, the equations (18) can be 
expressed as 


k k 
ad | = 0 (33) 


sPo Aa Nm-1> | | Onf gees Onk,_s 
since the quantities es of (25) are not identically zero for arbitrary values 
of the indicated arguments. This completes the proof of Theorem 3. 
The solutions of the system (33) are then related to those of (18) by 
the equation (12). The solutions of (18) determined in such a manner are 
invariant, in the sense of Definition 1, under the same group which leaves 
the system of partial differential equations invariant. 
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Consider that another set of functionally independent absolute 
invariants of Sg, be chosen; say 


Ce = Sil Mis Mma)» FSi!) FO, 


and that the F; of (12) are replaced by functions G; of the ¢,. It is evident 
that the result of Theorem 3 holds; that is, the resulting system of 
equations will have the ¢; and G; as independent and dependent variables 
respectively. This system of partial differential equations is related to 
the system (33) by the transformations of independent variable 


Ci _ Fils Nm-1)- 


5. Illustration of the theory 

Two examples, one from classical elasticity and one from fluid 
mechanics, will be used to illustrate the theory developed in the 
preceding sections. 


Example I. The deflexions of plates of uniform thickness 
In the classical theory the deflexions w of constant-thickness plates 
are governed by the fourth-order partial differential equation 


otw aw Aw P(z,y) (34) 


9 ie 


Viw — 2 ili : 
a day” © dy" D 





where P(x, y) is the loading (force/unit area) on the plate, Dis a constant 
known as the plate-stiffness factor, and x, y are coordinates in the middle 
(undeflected) surface of the plate. 
The equation (34) is absolutely invariant under the continuous one- 
parameter group 
i= az, 7j =a"y, ® = au, (35) 


where a is the numerical parameter (+ 0), m, n, p are real numbers, if 


ek Bae (36) 
m m 
and P(#,9) = P(x,y). (37) 


Two absolute invariants of the group (35), corresponding to the 
functions 7 and g of (2), are 
yom and wa-Pim, 


Hence, by (12) and (36), the invariant solutions of (34) must be of the 
form w = x4f(n) with » = y/z. (38) 


3695.2.3 s 
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By Theorem 3, f(7) must satisfy an ordinary differential equation, ob- 
tained by the substitution of (38) in (34): that is 
P(n) 
D”’ 
(39) 
where the condition (37) implies that P must be a function of n. Equation 
(39) can be used to determine the deflexions of a uniformly loaded 
(P(n) = constant) semi-infinite wedge-shaped plate. The results of such 
an investigation are reported elsewhere (5). , 


$ Pf we i W., 
(8 +1 Fg AMP HV) 5+ ABM) Fg 2A Ft Of = 


Example II: The laminar boundary layer 

As a second example consider a system of two partial differential 
equations which arise in the classical boundary-layer theory of Blasius 
si Oe vn th ue P eeee, (40) 
ox oy o oy oy” 
The equations} (40) are constant conformally invariant under the 


group) #=a", g=a'y, ti=au, t=ar, (41) 
where a is the numerical parameter (+ 0), m, n, p, g are real numbers, if 

nim=%3, gqlm=} (42) 
and p= 0. (43) 


The condition (43), equivalent to uw = @, is imposed from considerations 
of the physical nature of the laminar boundary-layer. 
One set of absolute invariants of the group (41), corresponding to the 
functions 7, g,, J. of the general theory, is 
ga", ag tm, gy-O, (44) 
Hence, by (12), (42), and (43), the substitution of 
u=F(y), v= a2 tB(n) 
with 7 = yx-}, into (40) will (by Theorem 3) yield the system of simul- 
taneous ordinary differential equations{ 
dF, dk, 0 1p OF, dF, d*F, 


—37f, +f, =r. 45 
—3 tae ae i ba | dnt 2 dy dy? ( ) 

+ In practice the first of equations (40) is identically satisfied by defining a 
stream function y such that u = ob/éy, v = —éys/éx. A single partial differential 


equation is obtained with % as the dependent variable. An application of the 
above theory will yield an ordinary non-linear differential equation. Example II 
has been given in order to illustrate the theory in the case of systems of partial 
differential equations. 

{ P. Lagerstrom has derived these equations, in some unpublished lecture 
notes, by using Birkhoff’s method. 
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6. Conclusion 

The substitutions, previously given without rigorous mathematical 
foundation, used to obtain the invariant solutions of partial differential 
equations arising in applied mathematics can easily be obtained by means 
of the theory presented herein. Examples of this are given by the prob- 
lem of the boundary layer over a flat plate with pressure gradient 
produced by an external velocity of the form U = Az? (A, q real con- 
stants) (7) and by the problem of the spread of a cylindrical jet (8). 
In fact most of the known cases, including the two mentioned above, 
can be motivated by using groups of the form (41). 

The method presented above does not depend on the boundary con- 
ditions which the solutions of a given partial differential equation must 
satisfy. Thus, for a particular problem, the invariant solutions of the 
equation may not satisfy the prescribed boundary conditions. Neverthe- 
less, if from physical considerations it is suspected that an invariant 
solution will satisfy the boundary conditions, the theory developed herein 
enables one to find its form with relative ease. Once this is done it is a 
simple matter to check whether such a solution will (or will not) satisfy 
the boundary conditions. Conversely, it may be possible to obtain new 
physically significant solutions of the partial differential equations arising 
in physical theories by examining them for invariant solutions without 
initial regard to boundary conditions. 

The author wishes to express his deep appreciation for the valuable 
suggestions contributed by Professor A. D. Michal of the California 
Institute of Technology. 
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FUNCTION SPACE: HILBERT-SPACE 
CORRESPONDENCES IN QUANTUM MECHANICS 
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Let L? denote the space of quadratically integrable functions f on the 
line —oo < x < wand let § denote the Hilbert space of vectors X with 
components 2,, X2,..., subject to ¥ |x,|? < oo. It is well known that, in 
terms of any orthonormal, complete sequence of functions ¢,, ¢9,... in L’, 
the Fischer—Riesz theorem furnishes a one-to-one correspondence between 
LP and §. Let X denote the vector in § corresponding to the function f 
in L? and let @ = (Q;,,) denote the ‘coordinate’ matrix with respect to 
the system {¢,}. It is easily shown that QX is in § and corresponds to 
af whenever the latter function belongs to L?. On the other hand, it is 
quite possible, as will be shown by an example, that QX can belong to § 
even if zf does not belong to L?. For the case in which the ¢, are the 
eigenfunctions of the quantum-mechanical, linear-oscillator, boundary- 
value problem, however, it will be shown that this latter situation cannot 
occur; thus, in this case, QX cannot be in § unless af is in L?. Certain 
consequences of the above results, dealing with the moments of distribu- 
tion functions, will be obtained. Similar results are valid if Q is replaced 
by the momentum matrix P. 

Let ¢,, ¢g,... denote an orthonormal, complete sequence of functions 
on —0o < x < o for which all functions x¢, belong to L*, so that 


| |x, |? dx < 00, (1) 
and define the coordinate matrix Q by 


Q = (Vx) = ( feb, $j az), (2) 


> 


Let f denote an arbitrary function of L*, so that 


|f|2 dx < 0, (3) 


Quart. J. Math. Oxford (2), 3 (1952), 260—7. 
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and define its Fourier coefficients x,, which constitute the components 


of the vector X, by 


xy = | fot de. (4) 
The Parseval relation . 
> lee? = f [fede (<0) (5) 


implies that X isin §. Furthermore, an application of the polarized form 
of the Parseval relation yields, in virtue of (2) and (4), 


(QX)x = T Qin tn = | afd de. 


Hence, in case the function af isin L?, the vector QX must be in §; cf. (5). 

Let &,, &5,... denote the normalized eigenfunctions associated with the 
linear oscillator? so that 

&, = c, Hy(x)e*™” = (= &). 
That (1) is fulfilled is clear; moreover, the corresponding coordinate 
matrix @ defined by (2), for 4; = &;,, becomes 
Q= (Vix) ~~ 2A sae tht 85e41) (6) 
(at least to within a constant multiple which can be supposed to be 1). 
The following will be shown: 

(1) Let f be an arbitrary function of L? and define its Fourier coefficients 
x,, in terms of d;, = &,, by (4). (Then the vector X = {ax,}isin §.) Suppose 
also that Q is defined by (6) and that the vector QX belongs to §. Then 
necessarily the function xf belongs to L?; thus xf satisfies 

[ |af|? dx < 00 (7) 
(and hence QX corresponds to af ). 

In order to prove (I) it will be convenient to deduce first the following 
lemma: 

(A) Let f and g be in L? and suppose that the vectors X = {x,} and 
Y = {y,} and the matrix Q are defined by (4), 


y= | otk de, 


— © 


uv 


and (2), for d, = 


+ E.C. Kemble, The Fundamental Principles of Quantum Mechanics (New York 
and London, 1937), 370-1. 


k- 
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Suppose that QX and QY (in addition to X and Y) belong to §. Then 
each of the iterated series 
oO @ * e 2 1 eo io ss i * 
ps (>¥ Yin)t and & (>a; x;)y% 
is convergent to the same ( finite) sum. 


If (A, B) denotes the scalar product p a,b of two vectors A = {a,} 


and B = {b,}, the assertion of (A) is that (X, QY) = (QX,Y). It should 
be noted that Q is not a bounded matrix} and that the last equation is 
not a consequence of the ‘formally Hermitian’ nature of Q (namely, that 
Vix = Gy) It is clear that 

(X,QY)—(QX,Y)=lim > { DY (Qin Yj Te—Uej Xj YD] 3 (8) 

no k=1 ‘j=k+1 

hence (A) will follow if it is shown that the above limit, which certainly 
exists, is zero. To this end, one need note only that, by (6), the right side 
ad (8) sesienaoan l = 2-4 lim mY ss ey—Un41 Yn): 


no 


Since > \yn+1 Ma—Tnsr Yal < (> \Ynl? > |x,.|*)*, 
it readily follows from the divergence of the series > n-* that l = 0, and 
the proof of (A) is complete. Next, the theorem (I) will be proved. 
Suppose that f is any function of L* for which QX is in §, where 
X = {ax,} is defined by (4) for ¢, = &, and Q is defined by (6). Let V 
denote an arbitrary positive number and let g = gy denote any function 
of L* which is zero outside the finite interval —N < x < N and which, in 
addition, satisfies 
a) N 
i \g|? da = | \g|? da = 1. (9) 
Su _N 
It is clear that xg is in L? and hence that QY is in §. An application of 
(A), in conjunction with the Parseval relation, now yields 
N — a 
| afg* dx = ¥ | afé* dx | g*é, da. (10) 
=" k=1_ 7) PS 
(It is to be noted that the functions f, xé,, belong to Z*.) An application 
of the Schwarz inequality, the Parseval relation, and (9) imply 
N 


| xfg* dx 


—-N 


o 


[ afét dex 


2 
< 00. (11) 














2 oo 
<> 
k=1 


t A. Wintner, Spektraltheorie der unendlichen Matrizen (Leipzig, 1929), 237. 
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The last inequality follows from the assumption that Q@X belongs to §. 
N 
If gis chosen to be af |( | |af|? d)* on —N <2 < N,and zero elsewhere, 
wa “N 


relation (11) implies that 
N 
| |xf|? da < constant (independent of )), 
J 


7 


for all positive NV, and hence that (7) holds. This completes the proof of (I). 


Next, a theorem similar in form to (I) will be stated for the momentum 
matrix P, where 2 
P = (Py) =i | bedf de. (12) 
(Il) Let f satisfy (3), so that f is in L? and the corresponding vector X 
is in §, and suppose that f has a continuous second derivative on 
—-o<4r< O. 
In addition, suppose that X and P are defined by (4) and (12), for ¢;,. = &;, 


and that PX belongs to §. Then f' belongs to L* (and hence, as is easily 
seen from a partial integration of (13) below, PX must correspond to if’). 


It is clear that all functions €§ belong to L? and that, as a consequence 
of a partial integration and the Parseval relation, the equation 


(PX), = —t [ See dx (13) 


is valid. Hence, if g denotes any function with a continuous first deriva- 
tive on —o <x< oo and which is zero outside a finite interval 
—N <2x< N, one obtains 


N 
(X,PY) =i [ f’g* dz (14) 
aa 
N « 
and (PX,Y)=-i}> [ g*é,,dx | SeK da. (15) 
k —N —a 


It should be noted that, in the derivation of (14), use is made of the 
assumptions on g together with two partial integrations. As follows 
readily from the proof of (A), a lemma similar to (A) is valid also for the 
momentum matrix, in the case of the linear oscillator (P having a formt 
similar to that of Q in relation (6)), so that the quantities of (14) and 


+ E. C. Kemble, loc. cit. 
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(15) are equal; hence, an application of the Schwarz inequality to (14), 
together with the Parseval relation and the assumption that PX is in §, 
yields ; 


2 N 
< constant [ \g|? dx, (16) 
-N 





N 
| [ f’g* dx 
—N 


where the constant is independent of NV. The proof can now be completed 
in a manner similar to that above. One first defines 


g(x) =s'e)/ ( 7 IP dz). 


—N+e 
on some interval —N+e« < x < N—e, wheree = e(N)isasmall positive 
number such that 


EB lien 


Furthermore, it can be arranged that g(x) = 0 outside —N <a < JN, 
that g(x) is defined on the two ‘e-intervais’ so as to have a continuous 
derivative on —o < x < o, and that 





< constant (independent of N). 


N N—€ 
[ \g|?@da < [ \g|? dv+-constant (= 1+ constant), 
—N N+e 


where the constant is again independent of N. The above requirements 
are easily fulfilled (the assumptions concerning the derivative of g on 
—N+e <a < N~e being a consequence of the assumption that f” is 
continuous) and lead easily, via (16), to the conclusion that f’ is in L?. 
This completes the proof of (II). 

In order to prove the remaining assertions made in the introduction, 
it will be shown by examples that the claims of theorems (I) and (II) can 
become false if, instead of the particular linear oscillator sequence €,, £5,..., 
an arbitrary orthonormal sequence ¢,, ¢9,... is allowed. Consider the 
half-line 1 < x < o and on it the set ‘Y = {y} of all functions % which 
are zero outside a finite interval and which satisfy 


[ WWltde<oo and [pdx =o. (17) 
1 1 


It will first be shown that the system ’ is complete on 1 < 2 < o. In 
other words, the only function g on 1 < x < o0 for which 


« 


. 


[ lg? dex < and | gf* da = 0 for all % in ¥, (18) 
1 i 








is 


fu 
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is the function g = 0. Let ¢ be a constant > 1 and consider the set of 
functions 

(either sin k(z—c) or cosk(x—c) (¢ <x <c+2n), 
,;= . 
\0, otherwise, 


where k = 1, 2,.... It is clear that (17) holds and that (18) implies that 
g = constant on every interval of length 27 on 1 < x < o. Since g is 
in L?, then g = 0, as was to be shown. 

By extracting functions from ’ and finite linear combinations of these 
(which also belong to “’) one can obtain an orthonormal, complete 
sequence y,, %,..., on 1 <x < 0, satisfying (17) for 4 = y,. Let the 
domain of definition of the functions %, be extended by defining Ys, = 0 
on —o <a < 1. Now let J_,, p_,,... denote an arbitrary orthonormal, 
complete sequence of functions on —oo < 2 < 1 and define _; = 0 on 
1<2<o. In addition suppose that (1) holds for ¢, = %_,. (That 
(1) holds for the ys, is obvious.) Let now the function f of L? be defined 
to be 0 or 1/x according as —0 <x<lorl <a<o. Thenaf=1 
on 1 <2 < o and, by (17), one has 


x 


[ af¥it da = | bx dx = 0 (for all k). (19) 
J : 

Since af = 0 on —o < x < 1, it is clear that 

[ afb* dx = 0 (for all k). (20) 


Furthermore, the orthonormal sequence {¢,} = {%,, _,} is complete on 
—0o <x < ©, satisfies (1), and, by (19) and (20), is such that the 
relations . 
[ afde dx = 0 
x 
are valid. In other words, QX = 0 (the zero vector); hence QX certainly 
belongs to §, although af is not in L?. 

The construction of an orthonormal sequence {¢,} for which PX is 
in §, although f’ is not in L’, is similar but slightly more involved. An 
outline of the construction is as follows: Let f be in L? and have a con- 
tinuous second derivative on —oo0 < x < oo and be such that f’ + 0 on 


1 <a2< o. Furthermore, suppose that 
: r 
| |f'|?dx <0 but |f’|? da = o. (21) 


. 


x —@ 
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The existence of such a function is clear. Let ‘ = {i} denote the set of 
all functions, with continuous first derivatives on —oo < x < oo, and 
which, in addition, are zero on —oo <x< 1 and also outside a 
(sufficiently large) finite interval of 1 < x < oo, and finally, which satisfy 


f f'b* dx = 0. 


(In particular, all functions ’ are in L?.) Then, again, the set ‘¥ is com- 
plete on 1 < x < o. The proof follows easily in a manner similar to that 
above if one notes that the functions sin k(a—c)/f’(x), cos k(a—c) f’ (x) 
if properly ‘smoothed out’ at the end-points of the intervals 


’ 


c< 4 <c+27, 


where c > 1, are functions of ‘’. An orthonormal, complete sequence 
of functions from ‘Y can be constructed so that 


| f'dt dx = 0. (22) 
1 


Again, let {y_,} denote an orthonormal complete sequence on 
—O < z< 1 
and define #_, = 0on 1 < x < o. In addition, it will be supposed that 


each #_, has a continuous derivative y_, on —oo < x < 0, for which 


~'_,, belongs to L?. Then {d,} = {x;,.,_;} is an orthonormal complete 
sequence on —o < 2 < oo and 


S| f setae 


holds; cf. (22) and the first relation of (21). Furthermore, as follows from 
a partial integration of a relation corresponding to (13), but where &, is 
replaced by the present ¢,, one obtains 


2 
< 0 








(PX), =i [ f'¢f de. 


Hence PX is in § although f’ is not in L*. 

Actually, with slight modifications of the above procedure, one can 
construct a system {¢,} for which even PX = 0, although f’ is not in L?. 
One should arrange first that f= 0 on —oo < x2 < 1 and then that 
x 
| |1/f’|? dz < co, at least for every finite X > 1. (This latter condition 
1 


was automatically fulfilled above, since there f’ was continuous and not 
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zero on 1 < x < .) In addition, as before, f should, and f’ should not, 
belong to L?. The existence of functions f satisfying the required con- 
ditions is clear. It is to be noted that (22) now holds not only for k but 
also for —k and that consequently PX = 0. 

Appendix 1. It is well known that each of the operators x and i d/dz, 
on —o <2 < o, has a continuous spectrum, consisting of all real 
numbers, and no point spectrum. However, as is seen from the above 
discussion, it is quite possible that there exist matrix ‘representations’ 
of these operators, defined by (2) and (12), for which the number 0 is in 
the corresponding point spectrum. Thus, for proper corresponding 
choices of {¢,', the equations 9X = 0 and PX = 0 can possess solutions 
X #0in §. 

Appendix 2. It is easy to obtain a certain implication of (I). Consider 
the case of the linear oscillator, so that ¢, = &, and Q is defined by (6), 
and suppose that fis in L? (and hence, X is in §). If, in addition, QX is 
in §, then, by (I), af is in L?. Another application of (I) shows that, if, 
in addition, Q(QX) is in §, then also 2*f is in L?. In general, if f is in L’, 
X = X, (in §), and ifthe vectors X,,, = QX;, arein § fork = 1, 2,..., N, 
then the functions af, x?f,...,a*farein L?. It is then an easy consequence 
of the Schwarz inequality that the relations 

x 

[ |xl*If[tde<oo (k= 1,2,..., 2N), 

x 
must hold. Hence, if f denotes any wave function for which all vectors 
| on X\,, belong to §, then the first 2N moments of the distribution 
function c, with density | f|?, ie.do = | f|*dx, are finite. (It is to be empha- 
sized that this result has been proved in case the functions ¢, are the 
normalized eigenfunctions of the linear oscillator boundary value problem 
(¢, = &,).) Similar results for the momentum matrix P follow from (IT). 


+ Cf.,e.g., V. Rojansky, Introductory Quantum Mechanics (New York, 1946), 92. 








NON-HOMOGENEOUS LATTICES IN 
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0.1. Introduction 


Tuts paper falls into three parts. The main object of the paper is to give 
a new proof of a theorem of the author’s (8) about the lattice properties 
of unbounded convex regions in the plane. This proof is contained in 
Part I. It is much shorter than the other proof and avoids the implicit 
use of the axiom of choice; but the other proof works in n-dimensional 
space, whereas this one, in its present form, is applicable only to the 
plane. Part III of this paper contains an application of the theorem of 
Part I to the product of two non-homogeneous linear functions; the 
results are similar to those of Blaney (1). In Part II certain critical 
determinants needed in Part III are evaluated. This evaluation is more 
or less a mechanical application of Minkowski’s method, except that a 
certain lemma (generalizing a result of Mahler) has to be proved first. 


0.2. Notation, definitions, etc. 

Generally speaking, the notation is the same as in (5), (8). Small 
Roman letters denote points of the plane, small Greek letters denote 
real numbers, capital letters point-sets. The notation (x,, x.) is used for 
the two coordinates of the point 2, and, in the usual vector notation, 
Ax, x+y denote the points with coordinates (Ax,,Ax,), (%,+Y1,%2+Y2) 
respectively. The symbol |z| denotes distance from the origin, and 
V(K) is used for the area of the point-set K. 

Point-sets are combined as in the following example: 


a+aP—Q 
denotes the set of all points that can be written in the form 
ator—y (re P,yeQ); 


fr K, int K, K denote the frontier, interior, and closure of K respectively. 
The symbols U, n are employed as usual for the union and intersection 
of sets. 


Quart. J. Math. Oxford (2), 3 (1952), 268-81. 
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Definition 1. A point-set K is called symmetrical in a point wu if 
K = 2u—K; wis then called the centre of symmetry of K. A non-empty 
bounded set cannot have more than one centre of symmetry. For if 


u-~v and . . . 
. = 2u—K, K = 20-—K, 


we have K = 2(u—v)+K, and so by induction K = 2é(u—v)+K for 
every integer €. Thus K contains point-pairs whose distance apart is 
2é|(w—v)|, contradicting the assumption that K is bounded. 

Definition 2. A lattice A is defined to be a set of points of the form 
p+éx-+ ny, where p is any vector, x, y are linearly independent vectors, 
and £, 7 run over the integers. The determinant d(A) of A is the absolute 
value of the determinant 2x, y.—2_ y. 

Definition 3. K is conver if K> (J {AK+(1—A)K}. 

0<A<1 

K is a convex body if K is bounded, closed, convex, and has an interior. 

Definition 4. If K is a convex body symmetrical in u, the critical 
determinant 5(K) is defined by the relation 


8(K) = infd(A) [we A,An K = {u}]. 
We shall require two known properties of 5(K). 
If we A, and d(A) < 8(K), 
then there is a point of A, other than wu, in K. (1) 
The second is Hlawka’s theorem: 


8(K) < V(R). (2) 


I. THe NoON-HOMOGENEOUS LATTICE THEOREM 


I.1. Statement of the result 

Let R be a closed convex set containing inner points. Suppose further 
that R is not the whole plane, a half-plane, or an infinite strip. Then 
there are two distinct tac-lines of R that are not parallel. Let us call 
these lines L, and L,. If we R, the set Rm (2u— R) is contained in the 
parallelogram with sides L,, L,, 2u—L,, 2u—L,. Thus Rn (2u—R) is 
always a convex body and the numbers V(R n (2u— R)), 8(R n (2u—R)) 
are finite. Henceforth V(R n (2u— R)) will be denoted by f(u). 

THEOREM |. Let R be a closed convex set satisfying the conditions stated 
above. Let A be a lattice. If An R # 0, then there is a pointxe An R 


such that 8(R mn (2a—R)) < d(A). 
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Since 6 is very closely related to the volume V [see (2)], it is natural 
that the properties of f(w) should play an important part in the proof. 
The main result in this connexion is of some interest in itself. 


THEOREM 2. Hypothesis: uc int R; ut+vefr R; f(u+av) < f(u) for 
each « such thatO0 <a <1. 

Conclusion: f(u+-av) < (l—o*)f(u) ff0O<a <1. 

The proof of Theorem 2 depends on some inequalities about convex 
sets due to Bonnesen [cf. Bonnesen (2), Henstock—Macbeath (5)]. Only 
very special cases of these inequalities are needed, and, since the proof 
is simple, it seemed worth while to establish them from the beginning. 


I.2. Bonnesen’s lemma 

Let K, K’ be two convex bodies, with projections of length a, «’ on 
a given line 1. We assume without loss of generality that / is the axis 
of x,. Let (8,8-+-«) be the projection of K and (f’, B’+-«’) be that of K’. 
Let ¢(7) be the length of the segment in which K is cut by the line 
x, = B+ ar, with a corresponding definition for ¢’(r). 

The segment in which the body K+K’ is cut by the line 


x, = B+B'+(a+a')r 
has then a length at least 4(7)+¢4'(7). Hence 


1 
V(K+K’) > (a+a’) { [4(r)+4'(r)] dr = (a+a')|- V(K) +— v(K")}. 
0 


If A, A’ are positive, we can apply this formula to the bodies AK, d'K’. 
Since the projection of AK has length Aa, and the area of AK is A°V(K), 
we have d V 
V(AK+2'K’') > atNxa’)|* V(K)+— vik"). (3) 
a a 
From this relation it follows by Cauchy’s inequality that 
V(AK +2'K’)t > AV(K)*+-A’V(K’)t. (4) 


This is the two-dimensional case of the well-known Brunn—Minkowski 
theorem. 

Each line parallel to the x,-axis cuts K in a segment. If « denotes 
the length of one of the longest of these segments, and a’ is similarly 
defined in relation to K’, the formula (3) will again hold. This follows 
by symmetrizing in the x-axis and then applying the previous result 
[see Bonnesen (2), Bonnesen—Fenchel (3, 94-6) |. Suppose, in particular, 
that K, K’ have centres of symmetry on the axis of x,. Then no line 
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parallel to the axis of x, will have a longer section with K (or K’) than 
the axis itself has. Hence we have 


Lemma 1. If K, K’ have centres of symmetry on a line | and if «, «’ are 
the lengths of l|n K, 1 K’ respectively; then the formula (3) holds. 


Definition. Let {K,} be a set of convex bodies, associated with the 
values of a real parameter a. {K,} is called a concave system of bodies if, 


whenever y = etic h ee tp, 
then K,, > AK, +(1—A) Kg. 


If {K,} is a concave system, it follows from (4) that the square root of 
V(K,) is a concave function of a, and hence that V(K,) has left and right 
derivatives at every point [see Hardy, Littlewood, and Pédlya (4)|. The 
symbols D_ and D, are used to denote left and right derivatives 


respectively. 


LemMMA 2. Suppose that {K,} is a concave system of bodies, each having 

a centre of symmetry on the same line l. Set V, = V(K,) and let l, be the 
length of the segment ln K,. Then, if B > «, 

> a 

(B—a)D, V,, > (—21,+1,) += 


Vp. 
ly Up 


Proof. If 0 < A < 1, we have, from Lemma 1 with ’ = 1—4, 
V(a+A(B—a)) > ((1—A)l,+Alg) a—a) lay! : 
lL, lp 


l 


V. ba 
+2 M+ 00). 


i She > (—21,+1,) 7 


By letting A > 0, the result follows. 


CoroLLary. If « > B, the same formula holds with the left-hand side 
replaced by (B—a)D_V,. This result is obtained by applying Lemma 2 to 
the concave system {IL} of bodies defined by L, = K_,. 


LemMA 3. Under the same assumptions as Lemma 2, we have 
io ae 2 


Proof. V* 


x 


is concave, so by (4) [91, Theorem 111], 
D_(V,)t > D,(V,)4. 


Since D_V, = 2V? D.(V,), the result follows. 
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1.3. Proof of Theorem 2 
Let K, = Rn (2(u+av)—R). I shall show that {K,} is a concave 
system of bodies. Since R is convex, we have 


R>AR+(1—A)R if0<A<1. (5) 
Now, if y = Aa+(1—A)B, it follows from (5) that 
2(u+yv)—R > A(2(u+av)— R)+(1—A)(2(u+fv)— R). (6) 


Hence, from (5), (6), by taking intersections, 
K, > AK, + (1—A)Kg. 
Let 1 be the line consisting of all the points that can be written in the 
form u+é&v, where & varies. Let R11 be given by the relation 
Te < é < i 
where 7 may possibly be infinite. Since wu € int R, 7 must be positive. 


As in Lemma 2, let /, be the length of 1m K,. Now ln K, is given by 
the two relations 


—r<é<l, 2-1 <é < 2a. (7) 
If we set o = $(1—7), there are two cases to consider. 
Case 1: « >o. The relations (7) reduce to 
2a—-1 <§ <1, 
and we have /, = 2(1—a)|€|. 
Case 2: « <o. The relations (7) reduce to 
—7r <€& < 2a+r7, 
and we have 1, = 2(a+7)|é| = 2(1—20+a)|€|. 
The two different expressions just found for J, make it necessary to 
split up the proof of Theorem 2 to cover the following three cases. 
Case 1: o <0. Then 1, = 2(1—a)|é| for 0 <a <1. We have, by 
Lemma 2, , 
aD, Vy > (—1—a)h+-— Vy, 


a’ 
Since V, > V, for all positive «a, D, VY, < 0, so 
V, < (l—a?)h. 


Case 2: o > 0; 0 <a <a. Again apply Lemma 2. We have 


0 > DV > (—r+a) 4 


@ 
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From the first of these inequalities, since V, > 0, we have o < 7 and so, 


since oc = }(1—7), we have o <}. (8) 


Case 3: «>a>0. Apply Lemma 2 three times to obtain the 


relations: iin =e ae 1-201, 
- Ife ** ie ™ 
V. l—o 
— 7 Ye aa Was 
ee a ioe 


—2 i 1 Ce 
eT Se eee he 
l—o 1— 
By a straightforward elimination process, using Lemma 3 to get rid of 
the derivatives, and using the relation (8), we find 
inet < {1504 |p, -1 (9) 
(l—o?)K ~ 1+ 
where D = 1—5a+802— 403 = (1—o)(1—2c)? > 0, by (8). We have, 
since go < a < l, 
8c? 80? 8 
Pon le gq2_ 8% < 8o07— 403. 
l+«a l+o l+o 
Hence the numerator of (9) is less than the denominator, and Theorem 2 


is established. 


I.4. Proof of Theorem 1 
Lemma 4. Jf ue int R, and u+ave R for all positive «, then f(u+av) 


tends to infinity with «. 


Proof. Let w be a vector perpendicular to v. Then, since u € int R, 
there is a positive number e such that ut-ewe R. Let T denote the 
triangle with vertices w--ew, u+3av. Since R is convex, R > T, and so 


R 9 (2(u+av)—R) > T 2 (2(u+av)—T). 


|, which tends to 





The area of the expression on the right is 2a|v|e|u 
infinity with «. 


Lemma 5. Let uc int R, w 4 0. Then one or other of the following two 


statements is true: 


(i) f(u) > f(u+aw) for all « > 0, 
(ii) f(w) ol aacil all B > 0. 


3695,2,3 
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Proof. Suppose the contrary, 


i.e. f(u+ow) > flu),  f(u—Bw) > fu) (a, B > 9). 
; B p a 
Appl h = —_ ee ae 
pply (4) wit errr op 
Then fut > > En (u+aw +f u— Bw) 


+B +B 


> f(u)', a contradiction. 


Lemma 6. If we An int R, and 8(R nm (2u—R)) > d(A), then there is 

a point ve An R such that 
, flr) < ¥flw). 

Proof. Since 6(R n (2u—R)) > d(A), we A, there is, by (1), a point 
ut+weAn Rn (2u—R) (w+ 0). Now, if utwedA, so also is u—vw. 
Hence, after Lemma 5, we may assume, on replacing w by —w if neces- 
sary, that f(w+-aw) < f(u) for all a > 0. Then, by Lemma 4, there are 
positive values of « such that w+ aw does not lie in R. Let 8 be the lower 
bound of such a. Then u+fw ¢ fr R. Sinceu+w e R, Bisnot less than 1. 


oie 2[p] > [2]+1 > B. (10) 
Let v = u+[B|w. We have ve A, and, by Theorem 2 


flv) < 2 — (BV <f(u), by (10). 


This proves Lemma 6. 

It is now simple to complete the proof of Theorem 1. By hypothesis, 
there is a lattice-point, say uw, cint R. If (Rn (2uy—R)) < d(A), 
Theorem 1 holds. Suppose, then, that 6(Rn (2u,—R)) > Me 4 By 
Lemma 6, there is then a point u,¢ An R, such that f(u,) < ?f(up). 
If, again, 8(R a (2u,—R)) > d(A), the process can be repeated, to obtain 
successively Us, Us,..., Where 


F(Un) < (4)"F(Uo)- 
Since, by Hlawka’s theorem (2), 5(R n (2u—R)) < f(u), the inequality 
8(R nq (2u,—R)) < d(A) must hold for some wu, of the sequence. This 
completes the proof of Theorem 1. 


Il. A Certain CriticAL DETERMINANT 


Before Theorem 1 can be applied to the product of two non-homo- 
geneous linear functions, the value of a certain critical determinant 
must be known. A generalization of a lemma of Mahler’s is needed in 
order to derive the result. 
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II.1. Mahler’s Lemma 

Definition 5. Let K be a convex body symmetrical about the origin. 
Let A be a lattice containing the origin. (In Part IT all lattices will be 
assumed to contain the origin.) Then A is called geometrically critical 
if A has 

(i) no point except 0 in the interior of K, 

(ii) six points +p,, +p, +(p,+p2) on the boundary of K. 

It has been shown by Minkowski that a critical lattice is always geometri- 
cally critical, but of course the converse does not hold. The problem of 
finding the critical determinant of K is thus reduced to that of finding 
the geometrically critical lattice of minimum determinant. 

Definition 6. If K possesses the property that, if 0 <A< 1, 

peK, qe K => (1—A)p+Aqe int K, 
then K is called strictly convex. 

Definition 7. If p, p’ are two linearly independent boundary-points 
of K (i.e. p # +p’), then every point of the plane can be written in the 
form Ap+pp’. If q is a boundary-point, and q = Ap+ pp’, where A, u 
are non-negative, we shall say that q lies between p and p’. 

Now consider a geometrically critical lattice A. Let p,, p, be as in 
Definition 1. Rename the points of A that lie on the boundary of K 
as follows: 

r= MN, PitP2 = Qe» P2 = Is) 
—Pi = Pa Ps ™ To —P2 = {- 
The boundary of K is then divided into six sets, defined as follows: 
C;, consists of the points lying between q; and q;,, (¢ = 1,..., 5), 
C, consists of the points lying between q, and q,. 
It is easily verified that these sets are disjoint, apart from the end- 
points q;. 

Lemma 7. [See Mahler (6), Lemma 6, where the result is proved for 
critical, instead of geometrically critical, lattices.] Let K be strictly 
convex. Let A, be any geometrically critical lattice dividing the boundary 
of K into the six arcs C;,,..., Cg. Let A, be a geometrically crit*cal lattice 
which does not coincide with Ag. Then there is exactly one point of A, in 
each of the six arcs C. 

Proof. Since A, is geometrically critical, there are six points of A, 
on fr K. It suffices, then, to show that no two of them can lie in the 


same arc, say C\. 
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Suppose, then, that, if possible, g = Aq,+pq2, 9’ = A’: +H 'Ge are 
lattice-points on the boundary of K, and that A, yp, A’, w’ are non- 
negative. I shall show that, unless q is g, or g., the following inequalities 


hold: 0<A<1, O<p<l, A+t+p>l. (11) 


Suppose first, if possible, that A+-4 < 1. Then 
gq = Aqui t+vget+(1—A—p)0 € int K, 
by strict convexity, unless two of the three coefficients are zero, i.e. 
unless g = q, Or gg. This contradiction proves the last inequality. 
Suppose again that A > 1. Since g,—q, = —p, € fr K, we have 


n= ITT — as) +} Oe int K, 


unless q = qj. 

Now we can suppose that q, q’ are not the points g,, q. in some order, 
for then the lattice A, would coincide with Ay, contrary to hypothesis. 
Then, by what we have shown, at least one of the pairs (A, ), (A’, u’) will 
satisfy (11), and the other will satisfy (11) if we replace the strict in- 
equality signs by weak ones. Since (11) implies 1 < A+ < 2, we derive 

JAA’ < 1, [ew] <1, [A—1’)+(4—-2')| <1. 
These inequalities mean that g—q’ (€ A,) lies in the interior of the convex 


cover of q,,..., Y, and hence in the interior of K. This contradiction proves 
Lemma 7. 


er 


II.2. Application 
Mahler’s lemma will be used in the proof of the following result. 


THEOREM 3. If 0 <A < 1, the critical determinant of the set 


—Ay+A+1\y} <1 (iy <1) (12) 
is 4,/(2—A). 

Note. Without risk of confusion, I use the notation (x,y) for co- 
ordinates instead of that of § 0.2. I shall use this more ordinary 
notation from now on, since we shall be dealing with special regions. 
The proof of Theorem 3 is given only for the case 0 <A < 1. A few 
simple modifications, left to the reader, will deal with A = 0. The result 
is also true for A = 1, when the set becomes a square. 

Proof. The set (2) is symmetrical, since the defining relation is 
unaltered by the substitution (x,y) (—z,—y). It is also strictly 
convex, as can be seen by writing the first inequality in the form 


Aly|2—A+1)|y|+(—a*) > 0. 
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The value of |y| must therefore be either greater than the large zero or 
less than the small zero of the polynomial on the left. Since the sum of 
the zeros is 1+-A-! > 2, the second relation |y| < 1 implies that |y| must 
be less than the small zero, i.e. 


Iyl < [A+ {018+ et!) = 4(2), say. 


Since ¢"(x) < 0, the strict convexity is proved. 

The boundary of (2) consists of two ares, which I shall call the upper 
arc (y > 0) and the lower arc (y < 0). 

Let A, be the geometrically critical lattice generated by the points 
Po (1,0), p, (4,«), where « is the smaller root of the equation 

Ay?—(A+1)y+? = 0. 
The lattice A, also contains the boundary-points p, (—},a) and pg, 
(—1,0). The points p all lie on the upper arc. 

If A is any geometrically critical lattice, then, by Mahler’s lemma, it 
will contain points ¢,, 4, g, on the upper arc, where q, lies in the are py p,, 
gain the are p, Pe, dg in the arc p. ps. Further gg = g,+43. (Considera- 
tion of the magnitude of the y-coordinate rules out such possibilities as 
91 = —Yo: V2 = W1—J3-) Let q, be (€, n), where, since q, lies in the arc py p,, 
pais G—AP+A+In=1, F<E<1. (13) 
Let g.¢, meet the axis of x at (s, 0). Since q,q, is parallel to Oq, a general 
point on the line g.q, is given in terms of a parameter t by (s+-té, tn). 
The values of ¢ at the points gy, g, where the line cuts the upper arc are 
given by a quadratic equation obtained by substituting x = s+4é, 
y = ty in the equation of the upper arc. Since q.q, is equal, as well as 
parallel, to Oq,, the difference of the roots is unity. On deriving the 
condition for this, and using the relation (13) to get rid of all powers of & 
higher than the first, we obtain 

g(A) = 4AA2—4A¢(A+1)—2(A+ 1) +3 = 0, (14) 


where A = —87 is the determinant of A. We shall see that both the 
roots of (14) are positive, so that the positive value of A really is given 
by —sn, not +87. 

The proof of the Theorem is now completed in two stages. Write 
# = },/(2—A). Then we have 


Lemma 8. @ is less than the larger zero of g(x). 
Proof. (A—1)?(4A+1) > 0. 
Hence 4\2(2—A) < (A+ 1)?. 





278 A. M. MACBEATH 
Take square roots and divide by 4A. Then 

< (+5) < (145). | 
Thus @ is less than the mean value of the zeros of g(x). 

LEMMA 9. @ is less than or equal to the smaller zero of g(x). 

Proof. Since g(x) can be positive only if x is greater than both roots 
or less than both roots, it suffices, after Lemma 8, to prove that g(@) > 0. 
Now g(9) = (A+1){3—A—2y—2€,/(2—A)}; so it suffices to show that 

3—A—2n > 2€,/(2—A). (15) 
Neither A nor 7 is greater than unity, so the left side is positive, and 
(15) follows from the same inequality with both sides squared, i.e. 
(3—A—2n)? > 46°(2—A). 
Substitution of the value of €* from (13) reduces this to 
(A—1)?(2y—1)? > 0, 
an inequality which is true. This proves Lemma 9. 

From Lemma 9 we deduce that A > 0, except possibly when » = }. 
Corresponding to » = }, we have the geometrically critical lattice 
generated by the vectors (6, +4), which does indeed have determinant 0. 
Thus @ is the critical determinant, and the theorem is proved. 


Itt. THe Propuct oF Two NoN-HOMOGENEOUS LINEAR FUNCTIONS 
Let a > 0. Let R be the region defined by the inequalities 
zs > 0, y>0, ry >a. 
It is well known that R is convex and that R contains a point of every 
lattice [see, e.g., (8)]. If p is the point (u,v), then Rn (2p— R) is given 
by the relations 
<2 <= Qu, 0O<y < WW, ry > a, (2u—2)(2v—y) > a. 


Apply the affine transformations: 


ds a ie wad a 
y= ofita’ /(1—Z)—w(1— / 2). 


This maps Rn (2p—R) on the region of Theorem 3, with 


= (fe) fs) 
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Since, for any affine transformation 7’, 8(7(K)) = |det 7'|8(K), we find 


that 8(R nm (2p—R)) = {,/(uv)+-3va}4{,/(uv)— va}!. From Theorem 1 we 
deduce at once that 


THEOREM 4a. There is a point of A satisfying the inequalities 
ry >a, £50, y>O, — {y(xy)+3va}t{./(ry)—~va} < d(A). 


This result may also be expressed in the form 


THEOREM 4b. If k > 2 and d(A) = a(k—2)(k?—4)!, then there is a 
point of A in the set 


a < xy < (k—1)*a, x>0, y > 0. 


The result in this form is best-possible if k is an integer greater than 2. 
It suffices to prove this assertion for the particular value a = (k—2)-'; 
for affine maps of the form (x,y) > (ax, ay) will then prove the result 
for any positive a. 
It suffices, then, to produce a lattice of determinant (k?—4)! with no 
point in the set 
(k—1)? 


<7 < 


k—2 E—2 (x >0;y > 0). 


This is done as follows. Let p, q run over all real numbers that are 
congruent to (k—2)-! (mod 1). Let «, a~' be the large and small roots 
of the equation 


t?—kt+-1 = 0. 
Then the non-homogeneous lattice given by 
x= aq—p, y = p—a-q 
has the desired property. To prove this it must be shown that the 
following relations cannot hold simultaneously : 


I ta Cnr : 
7 < kpq—p?—q? < 3 (16) 
| 
Pd = 7 ; (mod 1), 
xp—q > 0, aq—p > 9. (17) 


Now the form kpg—p?—q? admits the following linear transformation 


into itself p’ = kp—4, q' = p. (18) 


This transformation preserves the property p = q = (k—2)-! (mod 1), 
and, if we write t = p/q, 7 is transformed by the rule 


7 = k—7r-!, (19) 











280 A. M. MACBEATH 


If there were a pair satisfying (16), (17), we could assume, interchanging 
p, q if necessary, that the corresponding 7 is greater than 1. In the 
interval 1 <7 < a the transformation (19) preserves order, i.e. 


Ty <%= 1 < T 
Consider the sequence 
— on . _— —} 
%=1, o=—k-1; ..., of = k—oj-4, 


This is a monotone increasing sequence tending to the limit «; so, if a 
number 7 lies between 1 and a, then it falls between a, and a,,,, for some p. 
Hence, if we apply the inverse map of (19) » times, we obtain a number 7, 
such that 1 <r, < k—1. Going back to (18), this means that, if there 
is a pair satisfying (16), (17), then there is also a pair satisfying (16), (17) 


and satisfying in addition the further relation 
1 < pig <k—-1. (20) 
We have only to show that this is impossible. (17) and (20) together 
imply that p, g are non-negative; for (20) gives p/q positive and the two 
inequalities of (17), added together, give p+q > 0. Now consider the 
behaviour of the function kpq—p?—q? for fixed g. It has its minima for 
the interval g < p < (k—1)q at the end-points, for its only absolute 
turning-point is a maximum. Hence 


kpq—p?—q*? > (k—2)q?; 


and the inequality (16) implies 





so that 2 ame 


The only possibility then remaining for qisg = (k—2)-!. Then (20) leaves 
only two possible values for p: 


kl l 


Neither of these satisfies (16) with strict inequality, and so the result is 
proved. 

Note. The forms kpg—p?—gq* have occurred before in similar con- 
nexions in the work of Segre (9) and Blaney (1). The above seems to be 
a new property, since, if k is an integer not less than 4, the result ceases 
to be best-possible if the conditions x > 0, y > 0 are removed. 





ng 
fhe 
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The lemma of Part II and the theorem of Part I suffice also to prove 
another known result. Let R be the region y > z* and let p be the 
point (u,v). Rm (2p—R) can be reduced by an affine transformation 
of determinant unity to the set 


ly|+2* <a, 
where « = u—v*. The further transformation (x, y) > (atx, ay) maps 
this on the set of Theorem 3 with A = 0. Hence 


1 
V2 


8(R n (2p—R)) = — (u—v2p!. 


Applying Theorem 1, we deduce 
THEOREM 5. There is a point of A in the set 
0 < y—a? < {2d(A)?}!. 
This result, though best-possible, is merely the first of a series of minima 
[see (7)], whose existence has not so far been established by geometrical 


methods. 
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ON AN UNSOLVED PROBLEM IN THE 
ANALYTIC THEORY OF NUMBERS 


By A. Y. FAWAZ (Cairo) 


[Received 19 February 1952] 


1. Ler pa in bs = (¢ = o+ti; o > 1), 
L(x) = > A(n), L(x) = >’ A(n) (x > 0). 


It was conjectured by Pélya (7) in 1919 that 
L(x) <0 (4 > 2). (1) 


The calculations of Pélya confirm this up to x = 1,500, and the verifica- 
tion has since been extended to x = 20,000. [See Gupta (2), (3).| In 1942 
Ingham (4) gave reasons for supposing that, on the contrary, 


lim SUP » -tL(x) = +00 (4-00). (2) 

inf 
In a recent paper | Fawaz (1)| the explicit formula for L(x) was investi- 
gated; in particular, an explicit representation of a closely related integral 
I(x) as an infinite series was obtained. This was undertaken in the hope 
that it might throw some light on the conflicting hypotheses of Pélya and 
Ingham. In this paper I propose to follow out in detail the relationship, 
suggested in the concluding section of (1), between the behaviour of L(x) 
for large x and of J() for small positive x. 

I assume the Riemann hypothesis concerning the zeros of {(s), and the 
subsidiary hypothesis that all the (complex) zeros are simple. This 
assumption does not affect any of the proposed applications. The explicit 
formula for L(x) (or L,(x)) is then [see Fawaz (1)| 


i C(2p) - 
pe ee > P+] ~ 0), 3 


where p runs through the complex zeros of {(s), the terms of the series are 
suitably grouped, and 


Quart. J. Math. Oxford (2), 3 (1952), 282-95. 
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this integral being absolutely convergent for x > 0 [(1) § 3]. We may 
write (3) in the form 


I u 7 pu x ’ : 
Fu) = a ak (¢ ) = a+ > (% eyntit  e-Ynut), (4) 
n 
where x = e“, wis real, and 
l {(2p) , , 
— 5 ’ en = > ’ ‘diel. ees +Y¥,, t, 
° &4) po'(p) oe 
where n= +1, +2, +3.,...; y, is real; y_, = —Yni Gp = X- 


If we could treat the right-hand side of (4) as if it were a finite sum, we 
could show, by Dirichlet’s theorem on Diophantine approximation (or 
by the theory of almost-periodic functions), that any value taken by it 
must be approximately repeated for arbitrarily large values of u (of either 
sign); and we could deduce that any value taken by the function 


fo(x) = 2-{ Ly(x)—1(x)} 


must be approximately repeated for arbitrarily large values of x (and 
again for arbitrarily small positive values of x). In fact, the Dio- 
phantine approximation (or almost-periodicity) can be pursued in both 
directions from any given value of wu. Thus, if we could find a value of 


u for which F(u) > 0, (5) 
we might expect to be able to deduce that 
lim sup x-*{L,(~)— I(x)} > 0, lim sup x~-*{ Lo(x)—L(x)} > 0; 
Fes r—0-+ 
or, since x~tJ(x) > 0 as x > 00 and L,(x) = 0 for 0 <2 < 1, 
lim sup a? L,(x) > 0, lim inf a? I(x) < 0. 
ru r—0+ 


Since L,(x) differs from L(x) only when z is an integer, the difference 
then being +4, the former of these two inequalities could then be written 


limsupa!L(x) >0 (#0); 


and this would contradict (1). 

Since the right-hand side of (4) is in fact an infinite series, the purely 
formal argument is inapplicable as it stands since we cannot apply 
Dirichlet’s theorem to an infinite number of y, (even if the series con- 
taining them is convergent, and we know little about this here). In this 
paper I use the Fejér kernel method, as set out in Ingham (5), to reduce 
the infinite series to a finite sum, and we are thus able to cast our argu- 
ments into a rigorous form. More precisely, I shall prove the following 
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theorem, in which a, (n= +1, +2, +3,...;a_, = a&,) is the set of 
numbers occurring in the explicit formula (4). 


THEOREM A. (i) Jf the set of numbers «, is unbounded, then 
lim inf z-? L(x) = —oo, limsupa*L(x) = +0 (x00); 
lim inf z-*I(x) = —oo, lim supa-*I(x) = +0 (#>0+). 
(ii) If the set of numbers «,, is bounded, then 
lim inf z-* L(x) < xp {Lo(x%)—1(%o)} <limsupaL(x) (#0), 
lim infa-#I(x) < xg #{I(a9)—Lo(%9)} < limsupaI(x) (x +>0-+), 
where 2, is any fixed positive number. 


From this we deduce 


THEOREM B. liminfz-? L(x) = —limsup2-J (2), 
F ea’ 4) xr—>0+ 

lim supa L(x) = —liminfa-' (2). 
ro xr—0+ 


The object of these theorems is to enable us to apply to L(x) any 
information that we may obtain about /(2) (e.g. from its representation 
as an infinite series). 


2. The proof of Theorem A (ii) will be based on (4) together with a pro- 
perty of the partial sums of the series on the right-hand side (Theorem 
1). In establishing this property we can conveniently include a proof of 
(3)—or (4)—not given in detail in (1) owing to its similarity to the 
corresponding treatment of J/,(x) by Titchmarsh (8). When we come 
to the proof of Theorem B, we shall find that we are able to deduce this 
theorem (without repeating the Diophantine-approximation argument) 
by first choosing a suitable x, and then applying Theorem A. 


THEOREM 1. Let X beany ( fixed) positive number. Then, for0 << « < X, 
> «, 0 <K, 
Yn * ey ' 
where the numbers x, and p, = 4+y, i(n = +1, +2....) are those occurring 
in (4), K = K(X) ts a constant, and T = T,, tends to infinity through a 


particular sequence. 
Consider the integral 
rv (28) 
J = —.~ | Z(s)2* de, Ze) = ——, 
~| i 
5 


taken in the positive sense round the rectangle C with vertices a— 7%, 





-—- ease 7 


of 
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2—Ti, 24+ Ti, a+ Ti, where a is fixed (0 < a < }) and 7 + o through 
a sequence 7), (m = 1, 2, 3,...) such that 


l 
Z(s) = O(T*) ($<e0 <2; It] = T= Tn); 
where « is any fixed positive number. This holds also for —1 < o < }, 
t} = 7 =T,,, by the functional equation. The existence of such a 
sequence 7), is proved (on the Riemann hypothesis) by Titchmarsh 
[(8) § 2 (2.2), and p. 20; also (9) 304 (14.16.2)]. 

By Cauchy’s theorem of residues 

J=ar'+ D> a, xP», 
yn<T 

where a, = 1/f(4), as in (4). 

Let C,, C,, C3, C, be the four sides of C numbered in the positive sense 
from the lower horizontal side, and let J,, J,, J3, J, be the contributions 


to J of these sides respectively. On C, and C, 


a =O(T*), (28) = O(T*) (a<o <2, |It| = T=T,). 
C(s8 
Also, taking X > 1 (as we may), 

jae]; = a7 < X°< KX? (agac<x2; 0<2¢ X). 
Hence, on C, and C;, 


Z\(s)2° = = $28) 2° ~ O( TT 1X2) = O(T-4+%«X2) = O(X?), 
(6) 


when 7'-> oo through the sequence 7,,,. Since C, and C, are of fixed 
length 2—a, it follows that 


J,| < A, X?, | Jg| < A, X?*. 
For s on C;, Z(s)ae = > (ny 20" 
n=1 


and this series is uniformly convergent for s on C, (x fixed). Hence we 
may integrate term by term and obtain 


in >) An) 5 (x/n)* 3, 


8 





n=1 Co 


= Ly(e)— > Ama(z,2) = Lyle)—8, say, 
n=1 
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where the notation is the same as in Ingham [(6) 75, Theorem G]. Now, 
for x > 0[(6) 75, Theorem G, (10)], 


mod 2 
| S| =< 2 (") ss Ax. 
n=1 


. Hence, if0<2< X, 
|g] < |Lo(x)|+|S| <2+A_a* < X+A, X?. 
Finally [see (1) § 3, 96] 


| J,| = ae | Z(s)a* ds\ < at | | Z(s) ds| 
“7rl | “= 
' C4 C4 
<ee. | |Z(s) ds| < A,X*, 
2a 
(a) 


since the infinite integral is convergent. 
Collecting these results, we obtain 


J| < || +] 2|+ | Js|+| a! 
< 2A, X?4+X+4A,X2+A, X*. 


Hence, comparing with the result given by Cauchy’s theorem, we finally 
deduce 


| > ont < Jagat] J| < |ay|X'4+A,X24+X+A,X* 
| YnlS 1 


This proves Theorem 1. 

To deduce (3), or (4), from our formulae, take a fixed x > 0 and 
let T’ = T,, > o in the identity 

J,+4+43+5, = J. 
First, J, > 0 and J, > 0 by (6). Next, J, > L,(x), since the series S is 
uniformly convergent (with respect to 7') for 7’ > 0 and each term tends 
to 0 as T' + «0 [(6) 75, Theorem G, (10) and (9)]. Finally J, + —I(x) by 
the definition of J(~). We deduce (3), or (4), with the right-hand side 
of (4), for example, interpreted as the limit of 
at > (a, evn“¥+a, e-Yaut) 
O0<yn<T 

when 7' > oo through the sequence 7, 


m* 


3. Proof of Theorem A (i) 

The proof of Theorem A is divided into two parts corresponding to 
parts (i) and (ii) of the theorem. In part (i) of the theorem we use a 
classical argument due to E. Schmidt [see, e.g., Ingham (6) 91-2, 








a" 


i. —_~_— ee ao 





Ww, 
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Theorem 33]; in part (ii) we adopt the standard method of Ingham (5), 
but with the k and k* (or f and &) used the other way round. 
Let C be any real number. Then 


fMar+Oe yp 12, © sry 


. get ast sf(s) ' s—} ‘ 
i 
We write this as ; 
| Lilt) dx = Zs) (0 > 1). (7) 
a = 
i 


Now suppose that L(x) > 0 for, say, x > X, > 1. Then, by Landau’s 
theorem, the line of convergence of the integral on the left-hand side 
of (7) is o= 4. For the singularities of Z,(s) are at s = } (unless 
C Yo), $ = 0, and all the complex zeros of f(s), and the first of these 
we meet when we continue Z,(s) to the left along the real axis is the point 
s= 4} (ors = 0). It follows that equation (7) is true foro > $. Hence, 
foro > 3 and any real f, 


x 


x Xo x 
ZAo ! ti) | Lol) 4 < | Ll) | 4, _ | — da+ | = dx 


gorti xe 
Xo 
Xo Xo 
Lohe)|\— Soh) dx+Z Ao) < | 2|L,(x)| dx+Z,(c) = K+Z,(o), 
xe 
i 1 
where K = K(C, X,) = K(C) is a constant. We thus have 
ZAo+ti)| <K+Z,(c) (o> }). (8) 


Select a zero p = p, = 4+-y,,1 of {(s). In (8) take t = y, and multiply 
through by o—}. This gives 


(o—})|Zclo+yi)| <(o—})K+(o—})Zo(2) (0 > 9). 
Now let o> 3+. We then have 
|a&n| < a+C. 
The number C is at our disposal. Suppose it chosen so that 
C < |a,|—ap. 


Then the assumption that L(x) > 0 for all x sufficiently large leads to a 
contradiction. Hence L,(x) < 0 for some arbitrarily large x; that is, 


xtL(x) << —C 
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for some arbitrarily large x. Thus, when x > 00, 
lim infx-? L(x) < —C. 

Since C may be taken as near as we please to |a,|—« , and since this 
number can, by hypothesis, be taken as large as we please, it follows that 
lim infz-? L(x) = —oo (x00). 

In the same way, by considering the function — L(x)+ Cz! and taking 
C < |a,,|+a9, we show that, when x > oo, 
lim supx*L(x) = +00 (4-0). 
This completes the proof of part (i) of Theorem A as far as the operation 
x —> 00 is concerned. 
For x + 0+, we consider the integral 


| 1(7) dx 
x 
0 


* Lz) 


stl 
1 


and express the integral in terms of the zeta-function (e.g. by Mellin’s 
inversion formula). We have, for x > 0, 


rey = 2 [ Pea = oN E) “as e<a<d. 


dx 





in place of 





27i J sf(8) 2ri J sl(s)\x 
(a) (a) 
1 C(2s) 
Also Omi | eke) ds (0<a<}) 


(a) 
is absolutely convergent, and the integrand has a bounded derivative, 
and is therefore continuous and of bounded variation, in any finite 
interval of t. Hence, by the inversion formula of Mellin [see, e.g., 
Titchmarsh (10) 46, Theorem 29], 


(2s) file),  f,(l\,- 
ste) = wade = [1()orae (0 <a < }), 
0 0 





these integrals being absolutely convergent since I(x) = O(x*) (x > 0) 
for every fixed ain 0 <a < }[(1)§3, 96]. Now let c be any real number. 
Then 


co) 1 

oe ee, ae ee a 
een a | arte ene 
1 0 
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The integral on the right is a bounded function of s in 0 <6 <a < }. 
For, taking 0 < a < b, we have J(u) = O(u*) for u > 0, so that 


1 1 Be 
| (7) 1dx of [ 2-a+0-1 de] = of | teil az) = O(1) 
. 0 0 ; 


for alls inb <o < }, since b—a > 0. 
We write this as 


| J(x)as-1 dx = ¥(s) (0 <a < }). (9) 
1 
Now suppose that J,(x) > 0 for, say, x >a) > 1. Then, exactly as 
before, Y(o+ti)| <k+Y¥,(c) . (0 <0 < }), (10) 
where k = k(c, xy) = k(c) is a constant. Select a zero p = p, = $+y,1 
of Z(s). In (10) take ¢ = y,, multiply through by $—o (> 0), and let 
o—>4—. We then have 
|a,| < —ap+e. 
The number c is at our disposal. Suppose it chosen so that 
c < |&,,|-+-ap. 
Then the assumption that J,(z) > 0 for all x sufficiently large leads to 
a contradiction. Hence J,(x) < 0 for some arbitrarily large x. Since c 
may be taken as large as we please, we conclude that 
lim infatI(1/x) = —oo (x00); 
or lim infa-!J(x) = —oo (x>0-+). 
In the same way, by considering the function — J(1/x)-+-ca~! and taking 
¢ < |a,|—a», we show that 
limsupa-*I(x) = +00 («#>0+). 
This completes the proof of part (i) of Theorem A. 
4. Proof of Theorem A (ii) 
Write 
folx) = a-{Lo(x)—I(x)} (@ >0), Flu) = fyle". 
1—|: |jz| < 1), in 3y\° 
Let k(x) —_ 2d (|2| >= ) k*(y) ans nee ay . 
| Oo (jz| > 1), dy 





[ k(a)e**vi da = k*(y). (11) 


. 
— 


3695 .2.3 U 
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Now take a positive number 7», and write 


k,(u) = #(*). 
7 \1 
Then ki(y) = k*(ny). 

We start from the formula 

wt+n 

| k, (u—w)Fy(u) du 
_s w+n “ns mee 
=a f ky(u—w) du+ ¥ | ky(u—w)(ay, eM +a, eM) du. 

n=1 ° 
w-7 w-7 

This equation is obtained by multiplying the explicit formula (4) for /,(w) 
by k,(u—w) and integrating term by term from u = w—7 tou = w+7. 
The process is legitimate since /,(u—w) is integrable in the sense of 
Lebesgue and, by Theorem 1, a certain subsequence of the sequence of 
partial sums of the series occurring in the formula for F,(u) is boundedly 
convergent in the range [w—7, w+7]. Lebesgue’s theorem then shows 
that the term-by-term integration is legitimate, if the resulting series is 
interpreted as the limit of the corresponding subsequence of partial sums. 
But then, in case (ii), this series is in fact convergent in the ordinary sense, 
since the nth term is O(y,?), as may be seen at once from the form 
(12) given below. 

Now substitute uw = w+y; 

7 

| Fyly)Fo(o+y) dy 

My , 
=% | k,(y) dy+ 
—7 

Since k,(y) = 0 for |y| > 7, we may replace the limits of integration on 
the right-hand side by —oo, 00. Hence 


o io 
. KE (y)(n evdotui + a, e-vdo+unt) dy, 
v= . 

—9 


H “0 , , 
| k,(y)Ky(w+y) dy = x% k(O)+ & (on eyamt + a, e- Yat) KE (yn) 
Be 
. be ie .. (sin $ny,,\? 
= %+ Oy CYnM + OL, C~¥n" ( : ) . (12) 
° bs rn 


We now replace the infinite series on the right-hand side of (12) by the 
corresponding finite sum extended over those y, for which 0 < y, < 7. 
Since «, = O(1), by hypothesis, the error will be 


12, !l Far) |= ee 2 va) = Ca) 


yn>T yn>T 


n=1 








} 
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where the constants of the O’s are absolute. [See, e.g., Ingham (6) 70, 
Theorem 25b.] We thus have 


7 
} Aly) Footy) dy 


Ui 
= aot Deets aye nen (indin) oflog 
O0<ya<T $Y n T'1? 
oie +0(Fes (13) 
where 


S(w) = Sp,(w) = a+ (a, e741 & ¢ ae ena 
21Yn 
Now take an integer g > 3, and choose, by Dirichlet’s theorem on 
Diophantine approximation, a number U = U(T’,q) in the range 


q<U <qrrn (14) 


0<yn<T 


so that 
UY = Iy+by (In integer: ify] < 23m = 1,2,..M(D)}, 


2r q 


Then, for n 1, 2,..., N(7’) and all real v, 


etyAU +0i__etynvi| — | et(ynv+2m¢,)i__ ety, vi < 27d, | ed 2a 
q 
Hence 
: a) ] 
S(U+v)—S(r)| < > Jay] = o( bs 1) = - o(=e"), 
O0<yn<T q q O<ynS<T q 


(15) 
for all real v, since «,, O(1) and N(7T) = O(T log T). Hence, putting 


w = U+u, and w = up, in (13) and combining, we have 


7 log T' 
| k,(y){Fy(U +u9+y)—Fy(uy+y)} dy = S(U +) — Siu) +O tT ) 


U] 
= o(=—8 *\+0(F5 ) (16) 
q Ty 


We now choose q, 7 to make this small when T' is large, say 
log T' 
n= 1p Fi - q = dr = |Tlog?T| (T large). 


Then the right-hand side of (16) is 0(1) as 7’ > 0, say less than $e in 


modulus for all 7’ > 7, = 7T)(e) where « is any fixed positive number. 
Nj LT =) 
Since . 


| k,(y) dy = k*(0) = 1, 


| 
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this gives 
a Ay 7 
| AWPU +u+y)—F(uo+y)} dy < | kq(y)be dy 
a 7 
(y = ny; U = U(T,q) = U(T); T > TN). 

Since k(x) > 0 for all x, it follows that the expression inside {} on the 
left-hand side is less than }¢ for some y in [— 7, 7] and also greater than 
—t}e for some y in [— 7, 9]: 

F(U+ugt+y) < K(upt+y)+3e 
for some y in [—7, y], and 

K(U+uyt+y’) > £(uoty’)—t< 


for some y’ in{—7, |. Substitute y = u—up, y’ = wu’ —uy; 


F(U-+u) < F(u)+4e (17) 
for some u in [u)—7, U+7], and 
F(U+w’) > Biw’)—4e (18) 


for some w’ in [up— 7, Ug+7]. 

Since F,(w) is a continuous function of u except at isolated points, we 
may suppose that Fj(w) is continuous in the range [w»)— y, uy+ y] except 
possibly at uy. This is justified if 7 is small enough; that is, 7' large 
enough. Consider first the case in which Fj(w) is continuous at wu). Then 
we shall have Fale) Be < Fy(u) < Fy(ug)-+4e 
for all wu in [uy—7, u+7], if » is small enough. This combined with 
(17) and (18) gives F(U-+u) < F(up)-+e 
for some u in [u— 7, Ut], and 

Fy(U+u') > Fo(uo)— 
for some w’ in [u—7, U+7]|. Now choose U = Up, say, to satisfy the 
Diophantine-approximation condition and so that, in addition, U+u 
(or U-+-u’) is large and positive for all uw (or w’) in [uy—7, Uy+7]. This 
choice of U is possible if 7’ > T, = T,(e,u,), since U > gq. We then have 
F(U-+u) < Fy(u)+¢ 
for some arbitrarily large U+-u, and 
Fy(U-+w') > Fo(u)—e 
for some arbitrarily large U+-u’. Hence, keeping « fixed and making 
T — oo, we deduce 
lim inf Fj(u) < Fo(u9)+e, lim sup /j(u) > Ky(uy)—e (uu > oo). 
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Since « may be taken arbitrarily small, it follows that 
lim inf Fj(u) < Fo(up), lim sup F(u) > Ku) (uo); 
that is, lim inf (uv) < Ky(u>) < limsup F(u) (uo). (19) 


To deduce the corresponding inequalities for the lower and upper limits 
as u—> —oO, we now pursue the Diophantine approximation in the 
opposite (negative) direction. Thus, going back to (13), we now put 


w = —U-+u, and w = u, and combine. Hence, arguing as before, we 
have, in place of (17) and (18), 

F(—U+u) < Fy(u)+4e (17)’ 
for some wu in [u)—7, U+y], and 

F(—U+w') > Fy(u')—d€ (18)’ 


for some w’ in [u)—7,%)+7!. Assuming that F,(w) is continuous at up, 
we conclude, in the same way as before, that 
lim inf F(u) < Fy(uy) < limsup A(u) (uw > —oo). (20) 
Now suppose that F),(w) is discontinuous at the given point vw). Then 
Fy(Up—) < Foto) < F(Uot+) or Fo(to—) > Ao(Uo) > Fo(uo+)- 


Hence, since F,(w) is continuous except at isolated points, we can find 
u, and u, sufficiently near to wu» and on opposite sides of it such that 
F,(u) is continuous at wu, and uw, and also 


Fy(Uy) < Fou) < Fo(u2). (21) 
Now (19) and (20) are true for any uw, at which F,(u) is continuous. They 
are therefore true with wu, replaced by either u, or u,. Applying (19), for 
instance, with uw, replaced first by u, then by u., we have, when u — 00, 


lim inf F,(u) < Kj(u,) < limsup Aju), 
lim inf Fj(u) < Fy(up) < limsup Fj(u). 
These, combined with (21), now give 
lim inf F,(u) < Fy(u,) < Ky(up) < K(ue) < limsup F(u) (u— oo). 


We thus have 
lim inf F,(v) < Fy(u») < limsup Fj(u) (woo). 
Similarly (20) gives 
lim inf F,(u) < F(u,) < limsup H(u) (u—> —oo). 


This proves that (19) and (20) are true also in the case in which F,(w) is 
discontinuous at wu»). Hence they are true for any given Up. 
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Since e-*“J(e") > 0 asu>oo 
and Le“) =90 (u< 0), 


we deduce, putting x = e“ and x, = e“ in (19) and (20) and using the 
definition of F,(w), the desired inequalities as stated in part (ii) of the 
theorem. This completes the proof of Theorem A. 


5. Proof of Theorem B 
Let F(u) be the function defined in § 4, and let 


. su A_ . su A, ‘ 
lim Fy(u) = ” lim oe Fale) = ‘ (possibly +00). 
We have to prove that 
A. = A,, A. = A,. (22 


If the set of numbers a, is unbounded, the results are certainly true 
by Theorem A (i). We may therefore suppose that the «, are bounded. 
Now consider the left-hand equation of (22). If A, > A_, we choose a 
number B so that A, > B > A_. Then, since B > i_, we have 


Fu) < B 


for some arbitrarily large negative u. Select one such value of wu and 
call it uy. Then, by Theorem A (ii) in the form (19) (left-hand half), we 


have <8; 


and this is impossible since B was chosen to satisfy A, > B. If, on the 
contrary, A, < A_, we have a similar contradiction by Theorem A (ii) in 
the form (20) (left-hand half). Hence the only conclusion is 


A =A,. 


Similarly, using the right-hand halves of (19) and (20), we can prove 
the right-hand equation of (22). This completes the proof of Theorem B. 

The author is greatly indebted to Mr. A. E. Ingham for the suggestion 
of the problem and also for much valuable discussion and help. 


REFERENCES 
1. A. Y. Fawaz, ‘The explicit formula for L,(x)’, Proc. London Math. Soc. (3) 
1 (1950), 86-103. 
2. H. Gupta, ‘On a table of values of L(n)’, Proc. Indian Acad. Sci., Sect. A. 
12 (1940), 407-9. 
3. —— ‘A table of values of Liouville’s function L(t)’, Res. Bull. East Panjab 
Univ. (1950), 45-63. 





the 
the 











Noy 


ON THE ANALYTIC THEORY OF NUMBERS 295 


A. E. Ingham, ‘On two conjectures in the theory of numbers’, American J. 
of Math. 64 (1942), 313-19. 
‘A note on the distribution of primes’, Acta Arithmetica, 1 (1935), 201-11. 
— The Distribution of Prime Numbers (Cambridge, 1932). 


. G. Pélya, ‘Verschiedene Bemerkungen zur Zahlentheorie’, Jahresb. deutschen 


Math.-Ver. 28 (1919), 31-40. 


. E. C. Titchmarsh, ‘Some properties of the Riemann zeta-function’, Quart. 


J. of Math. (Oxford), 14 (1943), 16-26. 
- The Theory of the Riemann Zeta-function (Oxford, 1951). 
— Introduction to the Theory of Fourier Integrals (Oxford, 1937). 





A PROOF OF BLASCHKE’S THEOREM ON THE 


REULEAUX TRIANGLE 
By H. G. EGGLESTON (Swansea) 
[Received 27 February 1952] 


It is well-known that of all plane convex sets of constant width d, the 
Reuleaux triangle, bounded by three circular ares of radius d, has the 
least area. There are four proofs of this result due respectively to 
Lebesgue [(3) 132], Blaschke (2), Fujiwara (4), and A. E. Mayer (5). 
In this note a new proof is given which, like Lebesgue’s depends upon 
the fact that about any curve of constant width d may be circumscribed 
a regular hexagon of width d [(3) 131]. 

Let I’ be a convex plane set of constant width d circumscribed by a 
regular hexagon A BCDEF of width d, and suppose that the notation is 
such that the order of ABCDEFA is in the counter-clockwise sense of 
rotation about the hexagon. Take O, the centre of the hexagon, as the 
origin and the line through O parallel to A B as the initial line. Let H(6) 
be the support function of I and p(@) the radius of curvature of T' at the 
point where the support line makes an angle @ with the initial line. In 
measuring this angle the support line is supposed to be directed in the 
counter-clockwise sense of rotation about I’; p(@) is zero at a vertex of ’. 
Similarly define H,(@), p,(@) with respect to the Reuleaux triangle I, 
which can be inscribed in A BCDEF to touch it at B, D, and F. 

If two points of [ lie on a circle C’ of radius d, then the whole of the 
smaller arc of C’ joining these two points also belongs to ['.¢ If we apply 
this fact to the ares BD, DF, FB, bounding [, we see that 

A(6) > H,(@) (4x <0 < 32; or tm <8 < fn; Or 32 <8 < 2z). 
Similarly from a consideration of the Reuleaux triangle obtained by 
reflecting T, in O, it follows that 

H(0+7) > H,(@) (47 <0 < 3a; or tm <8 < $m; Or fn SO K< 2). 

The area of [is 


27 
b | p(6)H(4) do 


0 


= (f+ f+ | \e@H0)+9(0+)H0+m) ao 
iw yn = 

>a [++ + Fo + p(0-+7))H,(8) a8. 
ja 7 37 


tT See (1) 82; Lemma 1, Corollary 2 or 5. 
Quart. J. Math. Oxford (2), 3 (1952), 296-7. 
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BLASCHKE’S THEOREM 
But p(@)+ p(@+7) = d.+ Thus this last expression is equal to 





” 
EK ( iv ti r) 
1 1 
2) | + | + | j 27h (9) dé 
hor 7 Sar 
27 
= 4 px(0) H,(6) 48, 
1e 0 
1e and this last integral is the area of J}. 
O This proves the result. 
). + See (3) 128. 
n 
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ON INFINITELY MULTICOHERENT SPACES 
By A. H. STONE (Manchester) 
[Received 28 February 1952] 


1. Introduction 

THE ‘degree of multicoherence’, r(S), of a connected topological space § 
oo Geteee Sy r(S) = supb,(A n B), (1) 
the supremum being taken over all pairs of closed connected sets A, B 
such that AU B = 8S. Here b,(X) denotes the number of components 
of X less one (or oo if this number is infinite). If now r(S) = ©, it is 
natural to ask if the bound in (1) is attained, i.e. if closed connected 
sets A, B exist such that A U B = S and b,(A n B) = ov. If this is the 
case (as it is with the most familiar examples), we shall say that ‘r(S) 
is attained’. Spaces for which r(S) is not attained present a certain 
interest since in them watered-down versions of many properties of 
finitely multicoherent spaces will persist.t I shall show below (Theorem 2) 
that r(S) is always attained if S is a one-dimensional Peano space, and 
shall give an example (§ 6), after some preliminary theorems, of a two- 
dimensional Peano space S for which r(S8) is not attained. 


2. Open sets 

It is known [see (3), 403] that in (1) we may take the sets A and B 
to be open instead of closed (provided that S is normal and locally con- 
nected). The corresponding question of ‘attainment’ is easily settled, 
and we dispose of it first. 


THEOREM 1. If A and B are open connected subsets of a Peano space & 
such that AU B = S, then b,(A n B) < oc. 
Suppose the contrary. Since 

Fr(A) n Fr(B) c (S—A) n (S—B) = 0, 
there exist disjoint open sets U and V containing Fr(A) and Fr(B) re- 
spectively. Let {C)} be the (infinite) collection of components of A n B. 
Each Fr(C,) meets both Fr(A) and Fr(B), and C\ is connected; hence, 
for each A, there exists a point p,¢C,—(U UV). There exists a convergent 
infinite sequence (say) p,, > q (n = 1, 2,...). Thus 
qe(An B)—(UUV)cANnB, 
+ For example, if r(S) is not attained, and A is any subset of S with b,(A) < «, 


then (if S is locally connected and normal) all but at most a finite number of the 
components of S—A will have connected frontiers ; cf. (5). 


Quart. J. Math. Oxford (2), 3 (1952), 298-306. 
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and therefore g € C, for some A. The open set C), must then meet infinitely 
many distinct components C,,, which is impossible. 


CoroLLary. If G,, G, are open subsets of a Peano space such that their 


frontiers are disjointt and b,(G,)+b9(G.) < 00, then bo(G, N Gy) < 00. 


For it clearly suffices to prove this assuming G', and G, to be connected. 
But [(3) 401] there then exist connected open sets A > G,, BD G,, such 
that A U B = S, and G, n G, is both open and closed relative to A n B; 
and the result follows. 


3. One-dimensional spaces 
THEOREM 2. If S is a one-dimensional Peano space, and r(S) = ©, 
then r(S) is attained. 


The proof requires the following two lemmas; for the first, see (2), p. 214. 
The proof of the second is easy, and is omitted. 


Lemma 1. If T isa regular curve, and « > 0 is given, there exist regular 
curves H,, E,,..., LE, such that |J E; = T, 8(E;) < ¢, Fr(E,) (relative to T') 
is a finite set, the sets Int(H;) are pairwise disjoint, and no three distinct 
sets E;, E;, E;,, have a common point. 

Lemma 2. If E is any subset of a connected topological space T, and K 
is a connected subset of E which contains Fr(E), then (T—E)U K is 
connected. 

Now suppose that S is a one-dimensional Peano space for which 
Theorem 2 is false. I first show that S is a regular curve. To do this, 
consider any true cyclic element C of S. If M and N are any continua 
such that MU N = C, we have 

b(M nN) < «; (2) 
for there exists a monotone retraction (say) f of S onto C, and, if (2) were 
false, the sets f-1(M) and f-(N) would be continua covering S and 
intersecting in infinitely many components, contrary to hypothesis. 
Now let p be any point of C, and let U be any neighbourhood of p. 
Since p is not a cut-point of the Peano space C, we readily construct a 
continuum Fc C such that pe C—FcU. [See for example (6) 50.] 
Since further C is one-dimensional, we can cover F by a finite number 

+ This argument clearly applies to any normal, countably compact, connected 
and locally connected topological space S. It is easy to see from examples that 


these conditions cannot be greatly relaxed. 
t The condition Fr(G,) n Fr(@,) = 0 can be weakened to 


Fr(@,) n Fr(@,) n Fr(@, 0 Gy) = 0; 


this can be proved by refining the argument in (3) Theorem 8. 
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of connected open sets W,,..., W,, (relative to C), each meeting F and 
such that p ¢ C—W, and W; n (C—W,)iszero-dimensional. Let W = (J W, 
and take V to be the component of C—W which contains p; then 
peVcU, V is open relative to C, V and C—V are connected, and 
V—V is zero-dimensional. But, from (2) applied to V and C—V, V—V 
must now be finite; and this shows that C is a regular curve. This holds 
for every true cyclic element of S, and therefore [(6) 83] S is itself a 
regular curve. 

Since r(S) = oo, there exist continua A,, B, such that A, U B, = 8 
and b,(A, n B,) > 1. Choose e, positive and small enough, and apply 
Lemma 1 to cover S by regular curves E,,, Ejp,..., L,,,, with diameters 
less than e, and with the other properties specified in the lemma. Write 
C, for the union of the sets Z,; which meet A,, and similarly D, for the 
union of the sets H,; which meet B,; clearly C,, D, are continua covering 
S and such that b,(C, n D,) < oo. Further, if «, was chosen small enough, 
we have C, n D, = H,, U Aye, where H,,, Hy, are closed and non-empty 
and p(H,,, Hy.) > ¢,. (In fact, H,,, Hy, will be in preassigned neighbour- 
hoods of sets similarly splitting A, 9 B,.) Now, from Theorem 7 of (4) 
at least one of the sets E,;, say E,,, satisfies r(#,,) = 00. If £,, is not in 
C, 9 D,, we ‘join’ it to C, n D, by a ‘chain’ of sets E,,;: that is, we find 
a sequence of distinct sets E,,, E,;,, E,;,,.... Ey;,, of which every two 
consecutive members meet, and the last of which meets C, n D,. Take 
such a chain with as few members as possible, and let J, denote the 
union of its members. Then J, meets only one of H,,, H,, (for only the 
last ‘link’ Z,, | can meet either, and it is too small to meet both); and, 
on writing X, = C, U J,, ¥, = D,U Jj, we have that X,, Y, are continua 
covering S, and 1 <b,(X,n Y,) << o. Further, X,Y, contains the 
regular curve E,,, which we rename S,; thus r(S,) = 00, and Fr(S,) is 
finite, say consisting of the k, points p,), Py2,---, Px, 

The foregoing is the first stage in an inductive construction; the general 
process will be sufficiently illustrated by the next stage. Since r(S,) = ©, 
there exist continua A,, B,such that A, U B, = S,andb,(A,n Ba) > 2hy. 
Choose e, > 0 small enough, and apply Lemma | to cover S, by regular 
curves E,,, Ey9,..., Ey, With 5(E£,;) < €,. As before, let C,, D, be the 
unions of the sets #,,; meeting A,, B, respectively; then C,, D, are con- 
tinua such that C,U D, = S,, and C,n D, is a union of 2k,+1 closed 
non-empty sets H,;(1 <j < 2k,+1) every two of which are at a distance 
exceeding ¢, from each other. At least one of the sets E,,;, say E,,, has 
r(£,,) = 00; and this we join to C, n D, by a connected chain J, of sets £,; 
such that J, meets only one set H,;. Similarly each of the points p,;, 














ON INFINITELY MULTICOHERENT SPACES 301 


(1 <k <k,) is joined to C, n D, by a chain L,,, of sets E,; meeting only 
one H,;. Write 
X, = C,U UY Ly ¥, = D,U UU Ly: 

then X,, Y, are again continua such that X, U Y, = S,, and X, n Y, con- 
tains both E,, and Fr(S,). Moreover, X, 9 Y, = (C, 0 D,)U ZU U Lex: 
and, since J, U LJ L,;, can meet at most k,+1 of the sets H,,, it follows 
that k, < b)(X,9 Y,) < «. The set F,, is renamed S,; its frontier (say) 
F, relative to S, is finite, and, since Fr(S,) c F, U Fr(S,), it follows that 
Fr(S,) (relative to S) is also finite, with (say) k, points po},..., Poj,,; and 
so on. In this way we obtain a sequence of regular curves S,,, with 


S=828,5.., and r(S,) = 0, 


and sequences of continua X,, Y,, such that X, U Y, = S,_,, 


n? 
X,NY,28,U Fr(S,_,), and k,_, <6(X,N ne < ©, 
where k,, is the number of points in Fr(S,) (n = 1, 2,...). 

Now define recursively P, = (X,—S,)UX,, Q, = (4—S,)U%,, 
P= (Py-1—Sn) YU Xn, Qn = (Qn-1—Sn) U Ynys. It is easy to show 
by induction that P,, and Q, are closed, P, U Q, = S, P,N Qn > Sri 
and (using Lemma ue that P,, and Q,, are connected. Further, we have 


a n Qn = Po 1 n Qn- 1 —Int(S,,)} U (Xnss n Yn41)- 
Now, one of the components of P,_, Q,_, contains S,; the other 
components of P,_,9Q,-, are components of P,_,n Q,-,—Int(S,) 


which cannot meet X,,,9 Y,4,; and X,4,9 Yass has at least k,+1 
components, at most k,, of which can meet P,_, N Q,-,—Int(S,). Hence 


bol P, n n Qn) > b of Fo-t n Qn-1) )+1, 
from which we readily obtain b)(P,, n Q,) > n. 

Clearly P,,_, > P,, (for X,,4, cS, c X,, c P,_,), and similarly Q,,_, > Q,; 
hence, defining A = [) P,, B = [) Q,, we have that A, B are continua 
and AU B= S. Further, Hs is easy to see (by induction over m) that, 
if m >n, then P,,n Q,, > (P, N Q,)—Int(S,,,); thus P, n Q,,, and so 
also An B, meets each component of P, n Q,. Thus bo A n B) > n for 
every n, so that b,(A n B) = o; and this contradiction establishes the 
theorem. 


4. Infinite-dimensional spaces 

In this section I shall obtain a class of infinite-dimensional spaces S 
for which r(S) is infinite but not attained; a two-dimensional example 
will be derived later (§ 6). First I recapitulate briefly some results, 
mostly due to Eilenberg (1), on which the proofs will be based. 





302 A. H. STONE 


Let S be any continuum (for simplicity), and consider the family S'™ 
of (continuous) mappings f of S in the circle S! of complex numbers of 
unit modulus. If f, ge S1*, and, if for some X c S there exists a (con- 
tinuous) real-valued function ¢ defined on X such that 


f(x) = exp(i¢(x))g(z) on X, 


we write f ~ g on X; in particular, if f(z) = exp(i¢(x)) on X, we write 
f~1on X. Defining f.g by (f.g)(x) = f(x)g(x), where f, g e S™* (so 
that f(x) and g(x) are complex numbers), we see that S?* is an abelian 
group,t and that the mappings which ~ 1 on S form a subgroup, 
denoted by P(S). The factor group S1*/P(S) is denoted by B,(S); it is 
an abelian group without torsion. If X, Y are closed subsets of S such 
that X U Y = S, the mappings {: S > S! which ~ | both on X and on 
Y form a subgroup P(X, Y) of S™; the factor-group P(X, Y)/P(S) is a 
subgroup of B,(S), and its rank is denoted by p(X, Y). The least upper 
bound of p(X, Y), for all pairs of closed sets covering S, is denoted by 
p(S); if S is a Peano space, then p(S) = r(S). Finally, if X, Y are con- 
tinua such that X U Y = S, then p(X, Y) >6b,(X n Y). Thus, if S isa 
Peano space such that r(S) is infinite and is attained, then certainly p(S) 
is also ‘attained’, i.e. X, Y exist with p(X, Y) = oo. Hence the following 
theorem settles the question of this paper so far as infinite-dimensional 
spaces are concerned. 


THEOREM 3. Jf S is the Cartesian product of a sequence of (non- 
empty) continua S,,, then p(S) = sup p(S,,); and, tf each p(S,,) is finite and 
p(S) = 00, then p(S) is not attained. 


We require the following lemma. 


Lemma 3. If S=[]S,, then B,(S) is isomorphic to the restricted 


n=1 


direct product II B,(S,,). 
n=1 


To set up the isomorphism, consider first a mapping /,,: S, > S!, and 
define fx: S—> S! by fi(2,, 2,...) = f,(%,); it follows trivialiy that, if 
fn ™ Gn on S,, then f* ~ g* on S, so the class {f,,} € B,(S,,) is assigned 


+ Though all groups which arise here will be abelian, it is convenient to write 
them multiplicatively. 

t This theorem and its proof are fairly immediate extensions of results of 
Eilenberg’s [(1), 166, 168]. I give the argument, since it will be needed in § 6. 
There would be no difficulty in extending Theorem 3 to products of arbitrary 
many continua. 
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aunique class {f*} €¢ B,(S). Totheelement F = {{f,}, {f,},...} of [] B,(S,), 
where f,, ~ 1 (on S,) for all n > N(F), we assign the class 


OF = {fifz..-f'n} € BS); 


clearly @ is a homomorphism of B,(S,,) in B,(S). 
To define the inverse mapping, pick a fixed point = (&,, 5,...) in S, 
and, for each f: S — S!, define f,,: S, > S! by 


fn(®n) _ f(é Saye En-pv lp nti-e)3 


from the uniform continuity of f it follows that, if n > N(/), the set 
f,(S,) is small, so that f, ~ 1 on S, if nm > N [(1) 156]. If f ~g on S, 
it follows at once that f,, ~ g, on S,, and on setting 


af} = {th} th}, tiv} © TT BulS,), 
we see that w is a homomorphism of B,(S) in [] B,(S,). 

It is a routine matter to verify that w@ is the identity mapping. To 
show that @m is also an identity mapping comes to showing that, if 
fe S*® and f”: S > S' is defined by 

I "eas Bae) = [bnnkantutae > Ga.J* = fo) 
so that f" ~ 1 forn > N, then f ~ fif?...f% on S. Consider f!?: S > S! 


| where f12(x,, %o,...) = f(X1, Le, &gy---sEns---); we have f1? ~ fif? on the 


section S, x (5, €3,...) and also on the section €, x S, x (3,...), and there- 
fore [(1) 157] f12 ~ f1f? on S. This argument can be repeated, and 
gives by induction that f1f?...f% ~ f!*. on S, where 


f™-"t., Le.) = | ee tn, Ens me 3 


Here N can be arbitrarily large; if it is large enough, the uniform con- 
tinuity of f gives f ~ f!*--’ on S, whence Oam{f} = {f}. Thus 6 and aw are 


isomorphisms, as required. 


5. Proof of Theorem 3 

Since S can be projected onto each S,,, we trivially have p(S) > p(S,,) 
and so p(S) > sup p(S,,). Now consider the case in which sup p(S,,) = 0, 
but each p(S,,) is finite; we must show that p(S) is not attained. Suppose, 
on the contrary, there exist closed sets A, B such that A U B = S and 
the group Y = P(A, B)/P(S) has arbitrarily many (linearly) independent 
elements. From Lemma 3, Q, as a subgroup of B,(S), has an isomorphic 
image wQ in the restricted product [] B,(S,,). Choose an element 


FO {ff seo {fC} of wQ different from the unit element. I shall 
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show that wQ must also contain an element G® +1 of the form 
{1,1,..., 1, g¥,,},-.-}, Le. with {99} = 1 for alla < N. 


N 
For otherwise the homomorphism 7: wQ— [| B,(S,,) defined by 
1 


T{{ fi}. {fo}.---} = {{fi}.--..{fy}} would be an isomorphism into, so that Q 
would be isomorphic to a subgroup of B,(Z) where Z is the subset 
S, x SX... xX Sy X (Ey41,---) of S; and it is not hard to see that, in this 
isomorphism, each {f} € Q corresponds to the class {f| Z} (of f restricted 
to Z). Thus, since f~ 1 on A and on B, Q would be isomorphic to a 
subgroup of P(A n Z, Bn Z)/P(Z), giving p(Z) = oo. But [(1) 168] 
p(Z) = max{p(S,), p(S2),..-, p(S,)}; 

which is finite; and this contradiction shows that the desired element @® 
of wQ exists. 


N 0 
Now write X = fT &,, ¥Y = [J &,; 
1 N¥1 


then X x Y = S, and the mapping class 6F° contains a mapping F 
of X x Y in S' such that F(x, y) is independent of y € Y and, considered 
as a mapping of X in S!, is not ~ 1 on X. Similarly 6G contains a 
mapping G: X x Y > S! which is independent of x € X and is not ~ 1 
on Y. From [(1) 167], F and @ cannot be ‘2-compatible’+ on X x ¥, 
which contradicts the fact that 6F° and @G® are in Q; and this contra- 
diction proves that p(S) is not attained. 

Finally, if sup p(S,,) < 00, an entirely similar argument proves that 
p(S) < sup p(S,,). 

Remark. Essentially the same argument also shows that, in Theorem 3, 
p(S) is not attained even if some of the factors S, have p(S,,) = ©, 
provided p(S,,) is not attained for any of these factors. 


6. A two-dimensional example 

Theorem 3 can be used to derive a class of two-dimensional Peano 
spaces 7' for which p(7’') is infinite but not attained. For simplicity we 
consider only a special case. Let G,, be a linear graph formed from n 
simple closed curves having a single common point €,, but otherwise 


disjoint. Then p(G,) = r(G@,) = n. Let S =] G,; then S is a Peano 
1 


space and, from Theorem 3, p(S) is infinite but not attained. Now let 
T be the subset of S consisting of all points x = {x,} such that x, = &, 
for all but at most two values of n. It is easy to see that 7 is a Peano 
space, so that p(7') = r(7'); further, 7 is the union of a countable family 


+ Mappings f, g: S ~ S! are ‘2-compatible’ on S if S = AUB where A, B 
are closed and both f and g ~ 1 on both A and B. 
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of tori, and so is two-dimensional. There is an obvious monotone mapping 
of 7 onto each G,,; hence r(7') > n for each n, so that r(7') = 0. We 
have only to prove that p(7’) is not attained. 

The first step is to prove that B,(S) and B,(7’) are isomorphic in a 
natural way. This comes to proving the following two properties. 

(1) Each f: 7’ — S! can be extended to a mapping f*: S > 8'. 

For the uniform continuity of f shows that, if N is large enough 
(depending on f), f = exp(i¢)g where, for x = (a, %p,...) in T, 

g(x) = f(®4,-++5 tN, Engi) 
and where ¢ is a continuous real-valued function on 7’. There exists a 
(real-valued) extension ¢* of ¢ to S. To see that g can also be extended 
to S, it suffices to consider the finite product G, x ...x Gy. Since each G, 
can be regarded as formed by n segments with their ends all identified, 
the product G, x ... x Gy can be regarded as a cell-complex in which the 
cells are products of segments (with identifications on their boundaries); 
and g is defined on precisely the set of two-dimensional cells of this 
complex. Since the homotopy groups of S! vanish for dimensions > 2, 
g may be extended first to the 3-cells, and then to the cells of successively 
higher dimensions; finally g is extended to G,X...x Gy, and so to a 
mapping g*: S > S!. We merely define f* = exp(i¢*)g* on S. 

(2) Let f*, g* be mappings of S in S', and let 

f=f*|T, g=g*|T. 
If f ~g on T, then f* ~ g* on S. 

It suffices to show that, if f* = 1 on 7, then f* ~ 1 on S. Here 
again we may replace S by the cell-complex G,x...x Gy, N (=N(f)) 
being large enough, and have f* = 1 on the two-dimensional cells. It 
is not hard to show by induction that f* ~ 1 on the set of cells of 
dimensions not exceeding n (n = 2, 3,..., NV). 

Thus the correspondence between {f*} € B,(S) and {f} € B,(7’) given 
by f = f*| 7 is an isomorphism onto. It has the further property: 

(3) f* and g* are 2-compatiblet on S if and only if f and g are 
2-compatible on T. 

For, if f * and g* are 2-compatible on S, they are trivially 2-compatible 
when restricted to 7’. Conversely, suppose that f and g are 2-compatible 
on 7’. By a theorem of Eilenberg [(1) 164] the mapping (f,g): 7’ > S! x S? 
is ‘inessential’, i.e. f ~ f’ and g ~g’ on T where (f’,g’)(7’) is a subset 
of S!x S! which is homeomorphic to the graph G,. Call this subset G}; 

+ As defined in the footnote on p. 304. 
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since G also has zero homotopy groups in dimensions not less than 2, 
the same argument as before shows that (f’, g’) has an extension (f’*, g’*) 
mapping S in Gc S'x S81. Thus [(1) 164] f’*, g’* are 2-compatible 
mappings of S in S! which extend f’, g’ respectively. From (2) above, 
f'* ~f*andg’* ~ g* on S, so that f* and g* are also 2-compatible on S. 

Now suppose that closed sets A, B exist such that A U B = T' and the 
group Q = P(A, B)/P(T) has arbitrarily many indepertdent elements. 
Then Q, as a subgroup of B,(7'), has an isomorphic image in B,(S8), and 
so in [] B,(@,,); and the argument in § 5 derives as before two mappings 
F, G of S in S! which correspond to elements of Q and which cannot be 
2-compatible on S. But F| 7 and G| 7 are 2-compatible on 7’; hence 
(3) above is violated, and so r(7’) is not attained. 
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PATH GEOMETRY AND CONTINUOUS GROUPS 
By D. D. KOSAMBI (Bombay) 
[Received 31 March 1952] 


THIS note attempts to show the relationship between topological groups 
and the differential geometry of path-spaces. 


1. The paths of a path-space are defined as the real solutions in 
R*(z!,...,2") of the system of ordinary differential equations: 
t'+a!(x,z,t) = 0 (a = dzx'/dt, x' = dz*/dt). (1.1) 
For a proper geometrical exploration, it would be necessary to have ¢ an 
arc-like rather than an absolute time-like parameter; also, to have just 
one path through each direction at every regular point. The tangent 
vector & in Euclidean geometry is taken to define the same direction as 
az for any constant non-zero factor a, which means allowing for a scale 
factor int. The paths (1.1) must, therefore, have a restricted form which 
remains unchanged? for # = at+-b: that is, 
¢'+at(x, 2) = 0, Gat /ot = 0, ef = a'—}o’,, = 0; = (1.2) 
a semicolon and a comma in subscript indicate partial differentiation 
in and in x respectively. Besides requiring this restricted form of the 
path-equations, we further demand that through any two sufficiently near 
points of the base-space there should pass one and only one path. This 
means assuming sufficient differentiability and some Lipschitz con- 
ditions;{ for any x, the a’ are not much more general than quadratic 
homogeneous polynomials in #. 


Definition. A point of R” belongs to (the manifold or base-space of) 
the restricted path-space of equations (1.2) if through the point there 
passes just one solution in each direction and if any two such points with 
max|x—z’| < €« are connected by one and only one such path, for « 
sufficiently small. 

The manifold possesses a norm, say max|x—z’|, and a corresponding 
topology that has no direct relation to the paths. We proceed to endow 
it with a special topology as follows. The arcs are paths or continuous 


+ This quite, obvious result was first given by J. Douglas, Annals of Math. 
29 (1928), 143-68. 

t P. Hartmann and A. Wintner, American J. Math. 73 (1951), 149-72, and 
other references given there. The methods extend to our case. 


Quart. J. Math. Oxford (2), 3 (1952), 307-20. 
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curves made up of a finite number of path-segments. The new neighbour- 
hood of a point P consists of all points that can be reached by paths 
emanating from P such that any two points of the neighbourhood, 
including P, are joined by one and only one path. Starting the paths 
from P at t = 0, a neighbourhood may then be defined by t < 7. We 
consider only connected portions; any point that can be reached from an 
interior P by an arc, but which has not a complete path-bundle attached 
to it, belongs to the boundary. 

A fixed point, strictly speaking, would also be a path, for z = 2,4 = 0 
satisfies (1.2). We exclude these by not permitting all initial # to vanish 
simultaneously at P, by some condition like ¥ (#})? = 1 (which will later 
be replaced by g;;%}%, = 1). Then the coordinates of every point in the 
neighbourhood of a fixed P are determined with reference to those of P 
by means of t and n—1 of the initial direction-components, being in fact 
single-valued differentiable functions of n parameters; the parameters 
may, conversely, be expressed as such functions of the local coordinates. 
For the lens-space common to the two overlapping neighbourhoods of 
two sufficiently near points we have two different coordinate systems; 
the change from one to the other is described by differentiable functions 
of the coordinates themselves in either system. Therefore, by a classical 
result due to Whitney,t we have proved 


THEOREM 1.1. The base-space of any connected restricted path-space is 
a differentiable manifold, and may therefore be endowed with a (positive- 
definite) Riemann metric. 


This existence theorem gives no direct relationship between the path- 
equations (1.2) and the metric; the equations themselves need not (as 
they stand) be the extremalst of any variational principle at all, let alone 
geodesics of a particular metric. However, some indication of a ‘best’ 
choice and a special procedure is given by 


THEOREM 1.2. The neighbourhood of a point of any restricted path-space 
may be regarded as a neighbourhood of the identity in a topological group. 


Proof. We identify the fixed point O with the identity of the group. 
A group product law for two points P, Q in the neighbourhood is defined 
by taking QP as the point obtained on the path starting at t = 0 through 
P,in the same direction at P as for the path OQ at O; the value of t is 
to be taken as that needed to reach Q starting at t = 0 from O. This, 

+ H. Whitney, ‘Differentiable manifolds’, Annals of Math. 37 (1936), 645-80. 


t The equations of variation of the path-equations must be self-adjoint; ef. 
D. D. Kosambi, Quart. J. of Math. (Oxford), 6 (1935), 1-12. 
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naturally, implies fixing of the parameter ¢ in advance over the whole 
neighbourhood. The inverse of P is seen to be obtained by going back- 
wards on OP through the origin, i.e. replacing t by —t. The associative 
law holds, for the product depends only upon the order of the points, the 
neighbourhoods being chosen so small that any pair of points in the 
locality determines the path-segment joining them. 

A global extension of the local correspondence depends upon the 
original specification of the parameter, and the existence of a suitable O 
for the identity. Compactness or some such assumption simplifies the 
boundary-point question; but more than one point of the space may 
represent the same element of the whole group. 


2. The converse is more fruitful. 


THEOREM 2.1. The connected component about the identity of any locally 
Euclidean topological group with suitably differentiable underlying manifold 
is a restricted path-space. 


Proof. An actual process of realization may be set up thus: with the 
identity as origin O, we choose a (local) system of coordinates which maps 
some neighbourhood of the origin into a corresponding neighbourhood 
of Euclidean space, H,,. Close to O, take a point P and generate a curve 
segment by multiplying with P (under the group law, say always from 
the right) the group-image of every point of the straight line segment in 
E,, joining OP. This is extended by operation upon the curve segment 
with its new end-point P?, and so on. Letting P approach O along the 
fixed line segment, we obtain the path as the limiting curve through the 
origin for the fixed direction. This corresponds to a one-parameter semi- 
group, the rest of the group being obtained by operating with respect to 
P-', thus continuing the path through the origin. 

Every one-parameter group is Abelian and, with suitable choice of the 
parameter, a translation (because of the exponential mapping) in that 
parameter. Assuming that this has been done, differentiability implies 
that the path has a tangent, whence the continuation through the origin 
is smooth at O, i.e. in the same direction but opposite sense. 

The bundle of paths through O is now transferred to every point Q of 
the group manifold (we shall consider only the connected portion, but 
this causes no loss of generality) by taking the product with Q, always 
in some order fixed in advance. The identity of all the one-parameter 
groups (paths through O) may simultaneously be placed at t = 0; but 
there still remains an arbitrary scale-factor along each path, which can 
vary with the direction. However, the translation parameter may be 
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fixed along one chosen path, and the principle of continuity applied to 
restrict the others, t being required to vary continuously as P varies in 
any manner. Moreover, the scale factor is the same for any half-path and 
its continuation through O. These ¢ values are then transferred from the 
neighbourhood of O to that of Q, along with the path-bundle, by the 
group-product. The general parameter ¢ thus restricted still admits 
transformation, without changing the loci that give the paths; in par- 
ticular, it always admits the general transformation ¢ = at+-b (a + 0). 
Sufficient differentiability, assumed by hypothesis, means obeying 
second-order differential equations, which can always be solved for the 
highest derivatives, so that the paths may be described by equations 
of the restricted form (1.2). This completes the proof. 

The process of standardizing the parameter ¢ may be continued still 
further to obtain a Riemann metric for the group-manifold. The com- 
ponents of the metric tensor are calculated at O by identifying the 
parameter t in the limit with the distance s in each direction. Then 
this limiting process (and therewith the tensor g;;) is transferred to each 
point of the manifold by the group product. This makes the new parameter 
t, now completely determined at every point along every direction and 
by integration along every arc, the arc-length in terms of the Riemannian 
metric tensor g,;. We observe that the triangular inequality holds for the 
shortest distance between two points in terms of the Riemann metric; 
it holds therefore with small arcs consisting of paths. Now any simple 
curve joining OP, P near O, may be approximated by arcs made of joined 
path-segments, whence the path OP cannot be longer than any other 
curve. This gives us 


THEOREM 2.2. A Riemannian metric tensor g;; may be specified for the 
connected portion about the identity of the group manifold, the paths then 
becoming the geodesics of this metric. 


For portions not connected, one cannot speak of ‘distance’ along an 
are between any two points, but there is no other difficulty so long as 
some connected region exists about O. The essential process is that of 
determining ¢ as a path-parameter over the entire manifold, and this is 
not necessarily unique, even apart from the fact that for a non-Abelian 
group we obtain two different sets of paths according to the order of 
operation. 

The mechanism of transference from O to any other point of the 
manifold depends essentially upon the fact * rat the connected topological 
group is also a simply transitive group of ite on collineations ; hence of its 
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path-space. For a one-parameter subgroup, we should have a suitable 
generating vector u‘(x) and the expansion 


= : P re, , asa 
& = X+7u'(%)+ > Xu'+... (X = w'd/ea’). (2.1) 


On the other hand we have another formal expansion along any path: 


9 


# = 23+) +- = a'(2%9,%o)+.... (2.2) 
For any tensor 7';;, (2.1) leads to the formal expansiont 


WJ 


— Tj+cLT 5+. DT ;+-... (2.3) 


LT, = WT 54TH GAT Ute 


i 


Our derivation compels us to equate the parameters ¢ in (2.2) and + 
in (2.1) to the are distance, provided that the generating vector u‘ has 
been suitably chosen. This distance being preserved by transference, we 
have Lg;; = 0 (the equations of Killing). This does not mean that the 
numerical value of the components g;; are the same at every point, but 
that the components at any given point are determined in terms of those 
at the origin by the relationship between the coordinate systems at the 
two points g;; = g,, Ft F§, where the coefficients F} equal éx*/éz/, and 
also 0z'/éx/. This proves 


THEOREM 2.3. The relationship between the coordinate system at any 
point and that at O completely determines the components of g;; at that 
point in terms of those at O, by the tensor law under change of coordinates. 


In particular, the coordinate system at O can always be taken as 
Euclidean, by suitable linear transformation, if necessary, to give g;;(0) 
the correct form. The theorem then reflects again Whitney’st{ result 
for any differential manifold, where g;; is determined by the relationship 
between the local coordinate system mapping the neighbourhood into 
the Euclidean hypercube. 

The differential geometry of the group, which, because of differen- 
tiability, has a Lie algebra of infinitesimal collineations defined by the 
vector solutions w‘(x) of the equatiors of variations of the paths, can be 
determined in terms of the Lie constants of structure thus: 


+ D. D. Kosambi: Proc. Nat. Acad. Sci. (U.S.A.) 35 (1949), 389-94; Quart. J. 
of Math. (Oxford) (2) 2 (1951), 244-57; 3 (1952), 1-11. K. Yano: J. Fac. Sci. 
Tokyo, 6 (1949), 1-75; Groups of Transformations in Generalized Spaces (Tokyo, 
1949). t Loe. cit. 
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THEOREM 2.4. The mixed Riemann-Christoffel curvature tensor Ri,, is 
given at O by ef,,e%,, where the e’s are the Lie constants of structure of the 
group; the determination of the curvature tensor elsewhere is by transference, 
LRig = 9. 

Proof. The final portion of the theorem follows from Lg;; = 0; or 
from the fact that the paths are preserved, and the tensor may be 
calculated directly from the path-equations. For the first part, the 
definition of the constants of structure is by 

ur. a Wp, +— Up “=D ap = = cfg Up. (2.4) 
The u} are the n generating vectors of the group, whence det|w}| cannot 
vanish; L,, is the corresponding Lie operator. For our special choice of 
parameter, at O, L,, = é/és*, the directional derivative in the ath direc- 
tion at O. If we normalize the vectors at the origin, making the u‘, into 
direction cosines, this operator would be u‘, V;, where V; is the covariant 
derivative. Finally, the ath coordinate direction may be taken along ui, 
which then has at O the value 5‘. At this stage, alternating é?/@s%ds? 
at O amounts to alternating L Lg and also to alternating two covariant 
derivatives. The result of the latter operation upon any vector A° is 


(V;V.—V, V;)Aé = Rigg’. (2.5) 
For the vector A‘, we now choose one of the group generators, say wu}. 
With (2.4) applied twice and uw? = 8! in the final result, the Gussvom 


follows at once. 
The properties of the curvature tensor 


Rit Rin = 9, Rig + Rigs + Rij, = 0 (2.6) 


follow immediately from those of the Lie constants, namely 


Cay +eyp = 0, Ch. Coy + By Cpa t Cre ep = 9. (2.7) 
We further obtain the following consequences of (2.7), 
Rie = 9, Ria = Rin (2.8) 


which are necessary for the existence of a metric. Yet the choice of a 
particular coordinate system at O entails one relation between the metric 
tensor at O and the’ constants €By» from 


Yir Rha +91 Rin = 9, (2.9) 


which is then transferred to all points of the space; this is an identity 
for every Riemann space as well as the integrability condition} of the 


+ L. P. Eisenhart, Non-Riemannian Geometry (American Math. Soc. Collo- 
quium Pub. VIII, New York 1927), § 29. 
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partial differential equations g,,, = 0, i.e. the vanishing of the covariant 
derivative for the metric tensor. The group generators ui, may always 
be replaced by linear combinations with constant coefficients, which 
then means transforming the eg, like the components of a tensor in the 


Greek indices. 


3. The results of the previous section determine the differential 
geometry of a given differentiable group manifold. Some further steps 
may be taken, as for example identifying eg, with the components of 
the Cartan torsion tensor at the origin; in that case, the Riemannian 
curvature is determined by the torsion.t Moreover, it can be shownt 
that an antisymmetric component can be assigned to g,; whose exterior 
derivative is a constant multiple of the fully covariant torsion tensor. The 
total curvature tensor vanishes when such torsion is permitted, so that 
we have spaces with teleparallelism; but it simplifies the geometry to 
take the space with zero torsion though with non-vanishing curvature. 

The metric allows overall invariants of the manifold and therefore of 
the group to be calculated by known methods,§ such as extensions of the 
Gauss—Bonnet formula. If the group is a direct product, the manifold 
will necessarily be a product space, with a corresponding reduction of the 
metric into several portions each involving only such coordinates and 
their differentials as do not enter into any other portion. 

Though the group determines a Riemann space, the Riemann space 
does not necessarily determine the group uniquely. For example 


The manifold of an Abelian group is flat, so that its metric may be 
taken as Euclidean, and conversely. 


The direct result is obvious, but the converse is more complicated 
algebraically. If the curvature tensor vanishes, we have only eby Coa = 9, 
which does not immediately lead to e{g = 0. At first, we need only have 
eg = 0 for all a, 8, if p > m; also for all p with a, B < m where m < n, 
so that we have an m-dimensional subgroup all of whose operators 
commute with those of the entire group; and m cannot be zero. Taking 


+ The first step seems to have been taken by L. P. Eisenhart, but the real 
development is due to Cartan and Schouten, cf. L. P. Eisenhart, Continuous 
Groups of Transformations(Princeton 1933), 192-207; bibliography, 294-5. 

t D. D. Kosambi, Bull. American Math. Soc. 55 (1949), 90-4. 

§ That is, for closed Riemannian manifolds: C. B. Allendoerfer and A. Weil, 
Trans. American Math. Soc. 53 (1943), 101-29; S. S. Chern, Annals of Math. 45 
(1944), 747-52; Proc. Nat. Acad. Sci. (U.S.A.) 30 (1944), 269-73 ; Annals of Math. 
46 (1945), 674-84. S. Bochner, ibid. 49 (1948), 379-90 and 50 (1949), 77-93. 
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this m-dimensional Abelian subgroup as translations in the first m 
coordinates, we see that the rest cannot be made up of rotations. Two 
infinitesimal rotations in the coordinate directions (7j), (jk) combine to 
give (7k), so that there are not enough disjoint rotations and, if there are 
at least two such, there must be a third in the group, and one component 
of the curvature tensor will not vanish. Moreover, these cannot contain 
any of the translation coordinates, because that subgroup permutes with 
all operators of the n-group. Thus the entire n-group consists of transla- 
tions. The argument could have been shortened by recalling that we must 
have a simply transitive n-parameter subgroup of Euclidean rigid 
motions. Another simplification would be to note that the flat space is 
its own tangent space; then G. Birkhoff’s theorem applies,+ to the effect 
that the original space is the homogeneous space G/S, where G is the 
whole group of isomorphisms, S the (local) rotation group. 

Starting with a group and its manifold, the example shows us that 
there exist cases where the group itself is a proper subgroup of its own 
group of automorphisms. For example, the manifold of the Abelian 
group, asa Euclidean space, not only admits the group itself, translations, 
but also, in the coordinate system chosen, a group of rotations preserving 
the metric; while the path-space admits the general linear group, pre- 
serving the paths (straight lines) though not the metric. We have really 
a special case of 


THEOREM 3.1. If the constants of structure obey 
, 1 v v v 


the group-manifold is an isotropic Riemann space; hence the group itself is 
an n-dimensional subgroup of an 4n(n+1)-dimensional subgroup of the 
full projective growp in the n coordinate variables. 


Proof. The relations (3.1) and transference make the Riemann mani- 
fold isotropic (projectively flat, of zero Schur constant). It is a well- 
known result that such a manifold admits loc: ly a group of dimension 
3n(n+-1), and the fact that we have the whoie group here restricts the 
group and the space to a few classical types,f all of which are included 
in the general statement of the theorem. However, in view of the fact 
that path-geometry is wider than that of the group-manifold, it may be 
as well to extend the concept of isotropy and the result of the theorem 
to paths of type (1.1) with only éa‘/ét = 0. This is done as follows. 


+ Garrett Birkhoff, Math. Zeits. 53 (1950), 226-35. 
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When there is a metric, a subspace has paths which can be derived as 
the extremals, in the coordinates y, which define the subspace by the 


equations = 4 — i/ya) (j = 1, 2,...,n; a = 1, 2,..., m). 


Without a metric, this cannot be done uniquely. But suppose that some 
system of paths i +B'y,9) = 0 

be given a priori in the subspace, with the additional property that 
every path of the subspace is a path of the original space, the subspace 
being thus a totally geodesic variety. 

THEOREM 3.2. For a totally geodesic subspace, the path-functions obey 
the relationship 

2! 9g 9° —2i BP+-ai{aly), ai, 9°} = 0. (3.2) 
Moreover, the corresponding differential invariants of the original space 
and the subspace are related} by 
Ty. tthe Te. tt +... (3.3) 
The proof is by direct substitution and elimination. 

Definition. A space is projectively flat if it admits (locally) a system of 
coordinates such that every subspace determined by y* = x? (i = 1.,..., m) 
and a! = 0 (i = m+1.,..., n) is totally geodesic. 

THEOREM 3.3. For a projectively flat space, the paths have the equations 

&'+a'db(a,%) = 0. (3.4) 
This again follows by direct substitution of the special coordinate values 
in the above relationship. 

The equations of variation define the collineations, i.e. the auto- 
morphism group preserving the paths, which is given by the vector 
generators u‘(x) that satisfy 

ul ;,ea¢*+-a'Ld = 0, where Ld = w$,+u",2*¢.,. (3.5) 

[t follows that Ld can at most have the form #’a,(x). Now the operator 

e/é#) permutes with L, and L is a differentiation for which Lz' = 0, 

regardless of the group generator employed. By differentiation of 
(3.5) twice in #, we have therefore 
(a) 2u* ;;,+8)a,+8,a; = 0, 


(6) —2(u* j..—U' j2%) = 85 (aj, )—% y.) +8}, 0; .—8j 4; x =0. (3.6) 


+ The quickest method of proving this statement is to calculate the differential 
invariants of (3.2) by regarding those equations as defining a path-geometry of 
a special type; ef. D. D. Kosambi, Quart. J. of Math. (Oxford), 19 (1948), 204-19. 





316 D. D. KOSAMBI 


In the second, we contract successively for 1—j and i—k, to obtain 
a;; = 0, and also u‘;;,; = 0. These results are summed up in 


THEOREM 3.4. The group generators of a projectively flat space with 
paths é'+-2'd(x,%) = 0 have the form ut = —}ata,a"+bi x’+c! where the 
constants a; must obey the further restriction Ld., = a,. 

This proves our statement? that the group of all automorphisms is a 
subgroup of the projective group in n variables. The full projective group 
is not admissible, for it has n(n+2) parameters, whereas the maximum 
number possible for any path-space (without reference to a metric) is 
only n(n+1), and this number is realized if and only if the space is flat, 
the paths then being straight lines and the group that of translations 
plus the full linear homogeneous group. When a Riemann metric is to 
be preserved, the parameters reduce to $n(n+1) in number, being the 
maximum admitted by any such metric. 

Regardless of the space being a topological group or not, we can derive 
conditions of isotropy for an affinely connected manifold. Here, 
$(x,z) = $,%", the vector ¢; depending only upon the coordinates 2, 
not upon the directions. The coefficients of connexion are 


je = 2085 $e +35 5)- (3.7) 
Calculating the curvature tensor defined by 
Bia = Vin —Vieg + Tie TET The (3.8) 


we find its components to be 


Bina = HOi($ie— 36x br) 9k br53 44; PV +8(G;,4—Fx.5)}- (3-9) 


J 

When a Riemann metric exists, we further obtain: 

(1) The vector ¢; is a gradient ¢ ,;, for ¢;;—¢;, = 0, from Rf, = 0. 

(2) Either the space is flat and the curvature tensor vanishes identi- 
cally, in which case ¢ = —log(a,x’+b)?; or, the curvature constant K 
not being zero, we have the canonical form g,;; = (4;,;—4¢,¢,)/2K. 

(3) From the vanishing of the covariant derivative of g;,, reducing to 
{exp(—¢)},;, = 0, we see when K + 0 that 6 = —log(a,;x'x/ + 2b; x'+-c). 

(4) A further reduction follows from g,; ~ 0, with 


acai — gfe See lat 





. K\S S? (3.10) 
S = a, x'x)+ 2b, x'+-c 


+ For n = 2, the general, non-parametric path-space can be shown, by direct 
calculation of all the groups, to admit only subgroups of the projective group ; 
ef. D. D. Kosambi, Quart. J. of Math. (Oxford), (2) 3 (1952), 1-11. 
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A linear change of coordinates 2‘ does not affect the special form 
of the paths. It follows (by suitable transformation) that S must be 
non-degenerate, for otherwise there would be a singularity at the origin; 
S must be definite, otherwise ds? would change sign at O for different 
directions. 

THEOREM 3.5. The group of automorphisms of an n-dimensional 
isotropic Riemann space is either that of translations and rotations (flat 
space), or equivalent to the projective group of a non-degenerate central 
quadric. 

It follows as a corollary that the only groups that are isotropic spaces 
(n > 3) are Abelian, the spaces being flat. 

For, in the non-flat case, we may reduce the quadric to > (x‘)?—1 = 0, 
by an always-permissible linear transformation. The generators of the 
group G@ constitute the Lie algebra of operators x‘p;—2/p, of rotation 8S, 
and the projections p;—2‘x’p,. The last do not form an ideal or a sub- 
algebra of the Lie algebra, the commutator of any two giving a rotation 
operator. Therefore the homogeneous space G/S cannot be a group; yet 
it is isomorphic to the original space. 

A theorem of Cartan and Schouten,t that the space of a simply 
transitive semi-simple group is an Einstein space, with g;; proportional 
to R;,;, includes the case of groups that are isotropic spaces. 

Every path-space (1.1) can be reduced to the restricted form (1.2) by 
taking 2° = ¢ as an extra coordinate and a new implicit parameter; but 
the process is not unique, while the varieties x® = constant have no 
meaning for the original space, corresponding to impulsive motion in 
dynamics, just as the restricted path-spaces give kinematic trajectories. 
On the other hand, we get a rather obvious inverse result that justifies 
the name ‘projectively flat’: 

THEOREM 3.6. The paths of a restricted projectively flat path-space 
reduce to straight lines d?x'/d(a")? = 0 by taking x" as a new parameter 


in place of t. 


[Added 8 May 1952} 

4. The foregoing gives some insight into the mechanism of locally 
Euclidean topological groups with (orientable) differentiable manifolds. 
Its essence has now to be recapitulated for path-spaces that are not 
assumed to be groups, for there may be some difference between the 
topological group and its representation as a path space; this is seen from 


t See the first footnote on p. 313. 
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the fact that a compact Abelian group is topologically a torus, which 
cannot be a space of zero curvature. The particular difficulty, of course, 
is resolved by taking a hypercube whose opposite faces are identified. 
For paths ¢'+a‘(x,z,t) = 0, there is always the parameter of total 
differentiation d/dt, with a corresponding exponential mapping: 


+ 





> a ee 
©) at a" 
a ,_,,4F td (dF 
(b) B(x, &,t) = F+t—- sala}? (4.1) 
ie 8B dol 
(c) #=fta —5—5(G) +~ 


In the last formal expansion the point x’ is reached by starting from 
the origin (chosen arbitrarily) in direction #, = a‘, > (a‘)? = 1, to get 
a locally Euclidean correspondence for some neighbourhood of O, and 
hence for the whole path-space. To permit this, we must first assume 
that the formal series either converges, or at least represents in some 
unique fashion the point x‘ in (4.1¢). The method, of course, is to build 
up a sum of successive small translations in the parameter, say by 
overlapping intervals, which give our formalism its meaning. Secondly, 
x‘(ta) must be solvable to give ta‘ = f(x), the ta‘ being then coordinates 
that map a sufficiently small neighbourhood of O into that of #,,. This 
means the vanishing of the two differential invariants a‘—}7".",, and 
6a" /6t. 

The coefficients F'} = éz'/éx; are obtained for the local correspondence 
0<—-2z by #' = Fia’. From differentiation of (4.1c) and the homo- 
geneity of a‘, we factor the formal series 
¢ (i 
2!\ dt 
From inverting (4.1c) for ta‘ in terms of x‘, we obtain Fi = Fi(x); it is 
seen that the homogeneity of a‘ permits the simultaneous elimination 
of ta‘ in (4.2) after a” has been factored out just once. The determinant 
| Fi| being unity at O will remain positive for ¢ sufficiently small, which 
makes the neighbourhood orientable. 

In this 0<—~ x correspondence, the tangent vector z‘ to the path- 
segment (necessarily assumed unique) joining O to x always corresponds 
to the initial direction a‘ of the path. Since O was arbitrary and we can 
always go as far as desired by overlapping neighbourhoods, this means 
geometrically that the paths are autoparallel lines. 


WE cn it ale 
x’ = a’—tay— 


= a8 HY = ar Fi, (4.2) 
0 
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As seen from the treatment by group theory, a change of parameter 
will in general be necessary to reduce the path-equations explicitly to 
a suitable form. This change will depend not only upon the point but 
also upon the direction; s being another parameter, ds/dt = (x, #), with 
% homogeneous of degree one in #. After such a change, a‘ with the new 
parameter must reduce to those of a Riemann space, hence in the first 
instance to quadratic polynomials in #. Direct calculation shows that 
this amounts to replacing a’ by a'+2a'd(x,%) (with a corresponding 
recalculation of the formal relation between coordinates 0 <—> x), where 
¢ is still homogeneous of the first degree in ¢. This is called a general 
projective change of parameter; such change must make the path-space 
a Riemann space. The only essential differential invariant of the path- 
space which is independent of such change is the generalized Weyl 


tensor 


i | } i r ~8 Jer 1 r ~$ Jer 
Wi, = Ri, + —— 8)( Ri, —#* Rigy) ——— 84 Bj, —# Rjur) 


Rinse 
(4.3) 
where 
Ri, — 4(Pi..— Pi.;), P; a pre. oat i +a 

THEOREM 4.1. A path-space with unique path-segments joining suffi- 
ciently near points admits a Riemann metric by projective change of 
parameter, and is completely characterized by the necessary and sufficient 
conditions da! /ét = 0, x'— fara, = 0, Wim = 0 (given also sufficient 
differentiability). 

The proof is obvious from the foregoing, because Wj, must be linear 
in , as for a Riemann space. We obtain a simple corollary, that the 
paths of any symmetric affine connexion may be made the geodesics of a 
Riemann space by suitable projective choice of parameter, provided that 
the paths have the property of uniqueness between two sufficiently near 
points. 

For this corollary, we can give a direct construction as proof. The 
contraction Rf,., is always a curl and may be made to vanish by suitable 
choice of the vector j; in adding }(3; u;,+8}u;) to T'j,, which is the effect 
of the projective change of parameter. We assume this to have been 
done, the vector 1; being then determined to within an additive gradient. 
This arbitrariness is used, if necessary, in making 

Gir Rhus +Gr; Bias = 9 
at the origin, where the tensor g;; can always be taken as 5;; at O. The 
general tensor g;; is now determined by the yosbocinhed 0<>2 
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correspondence and the tensor law of transformation, which makes the 
covariant derivative of that tensor vanish; in fact the above relation with 
the curvature tensor will hold at every regular point if it holds at the 
origin, and that is the integrability condition for g,;,, = 0. Finally, the 
determinant |g,;| will not vanish in general, since it was taken to be 
unity at O, and its partial derivative is a multiple of the contraction I%,. 
This seems to indicate that the projective curvature tensor, and not 
the Riemann—Christoffel tensor, should play the fundamental role in 
calculating overall invariants of path-spaces and group-spaces. 


[ Added 19 May 1952] 
The anchor-ring in F, is the closed compact S, of genus one 


x! = (a+bsin #)cos ¢, x? = (a+b sin @)sin 4, x? = beos@. 

The metric being 
ds* = b* d6@*+-(a+bsin 6)* d¢?, 
no coordinate system exists making all the geodesics simultaneously 
into straight lines. In E,, however, the intersection of two cylinders 
x! = acos8, x? = asin#@, x? = beos¢, x* = bsin¢d 

has the metric a? d6?+-b?d¢? of zero Gaussian curvature. It is only in 
this second form or its equivalent that the torus can represent a compact 
Abelian G,; not as the anchor-ring torus. In the large, the difference 
appears in the imbedding properties of the group-space. The argument 
extends obviously to higher dimensions. 

The developments of the first two sections give us, starting from a’ 
locally Euclidean group with suitably restricted paths, a Lie algebra of 
the infinitesimal collineations of the path-space into itself. The structure 
of this Lie algebra being determined entirely by the paths, hence by the 
original group, we may use the algebra to define an analytic group and 
manifold. It follows that the original topological group-space can be 
mapped upon the analytic manifold, again identifying the one-parameter 
groups and their cosets with the original paths. This exposes the 
structure of the Hilbert conjecture (fifth problem) for locally Euclidean 
groups that are also path-spaces; the problem is of little interest for 
path-geometry. 
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